Domain-Aware RAG: MoL-Enhanced RL for Efficient Training and Scalable Retrieval
Lin, Hao, Xie, Peitong, Chen, Jingxue, Lin, Jie, Tang, Qingkun, Lu, Qianchun
–arXiv.org Artificial Intelligence
Retrieval-Augmented Generation (RAG) systems rely heavily on the retrieval stage, particularly the coarse-ranking process. Existing coarse-ranking optimization approaches often struggle to balance domain-specific knowledge learning with query enhencement, resulting in suboptimal retrieval performance. To address this challenge, we propose MoLER, a domain-aware RAG method that uses MoL-Enhanced Reinforcement Learning to optimize retrieval. MoLER has a two-stage pipeline: a continual pre-training (CPT) phase using a Mixture of Losses (MoL) to balance domain-specific knowledge with general language capabilities, and a reinforcement learning (RL) phase leveraging Group Relative Policy Optimization (GRPO) to optimize query and passage generation for maximizing document recall. A key innovation is our Multi-query Single-passage Late Fusion (MSLF) strategy, which reduces computational overhead during RL training while maintaining scalable inference via Multi-query Multi-passage Late Fusion (MMLF). Extensive experiments on benchmark datasets show that MoLER achieves state-of-the-art performance, significantly outperforming baseline methods. MoLER bridges the knowledge gap in RAG systems, enabling robust and scalable retrieval in specialized domains.
arXiv.org Artificial Intelligence
Sep-9-2025
- Country:
- Asia
- China > Jiangsu Province
- Nanjing (0.05)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore > Central Region
- Singapore (0.04)
- China > Jiangsu Province
- Europe > Spain
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Massachusetts > Suffolk County
- Boston (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Suffolk County
- Mexico > Mexico City
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Technology: