RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Huang, Jerry, Madala, Siddarth, Sidhu, Risham, Niu, Cheng, Hockenmaier, Julia, Zhang, Tong

Mar-16-2025–arXiv.org Artificial Intelligence

Recent research highlights the challenges retrieval models face in retrieving useful contexts and the limitations of generation models in effectively utilizing those contexts in retrieval-augmented generation (RAG) settings. To address these challenges, we introduce RAG-RL, the first reasoning language model (RLM) specifically trained for RAG. RAG-RL demonstrates that stronger answer generation models can identify relevant contexts within larger sets of retrieved information -- thereby alleviating the burden on retrievers -- while also being able to utilize those contexts more effectively. Moreover, we show that curriculum design in the reinforcement learning (RL) post-training process is a powerful approach to enhancing model performance. We benchmark our method on two open-domain question-answering datasets and achieve state-of-the-art results, surpassing previous SOTA generative reader models. In addition, we offers empirical insights into various curriculum learning strategies, providing a deeper understanding of their impact on model performance.

large language model, machine learning, qwen2, (18 more...)

arXiv.org Artificial Intelligence

Mar-16-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Italy (0.28)
- North America
  - Canada (0.28)
  - United States (0.28)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Education > Curriculum (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.85)
  - Natural Language
    - Chatbot (0.85)
    - Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found