PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
Wu, Jiayi, Cai, Hengyi, Yan, Lingyong, Sun, Hao, Li, Xiang, Wang, Shuaiqiang, Yin, Dawei, Gao, Ming
–arXiv.org Artificial Intelligence
The emergence of Retrieval-augmented generation (RAG) has alleviated the issues of outdated and hallucinatory content in the generation of large language models (LLMs), yet it still reveals numerous limitations. When a general-purpose LLM serves as the RAG generator, it often suffers from inadequate response informativeness, response robustness, and citation quality. Past approaches to tackle these limitations, either by incorporating additional steps beyond generating responses or optimizing the generator through supervised fine-tuning (SFT), still failed to align with the RAG requirement thoroughly. Consequently, optimizing the RAG generator from multiple preference perspectives while maintaining its end-to-end LLM form remains a challenge. To bridge this gap, we propose Multiple Perspective Preference Alignment for Retrieval-Augmented Generation (PA-RAG), a method for optimizing the generator of RAG systems to align with RAG requirements comprehensively. Specifically, we construct high-quality instruction fine-tuning data and multi-perspective preference data by sampling varied quality responses from the generator across different prompt documents quality scenarios. Subsequently, we optimize the generator using SFT and Direct Preference Optimization (DPO). Extensive experiments conducted on four question-answer datasets across three LLMs demonstrate that PA-RAG can significantly enhance the performance of RAG generators. Our code and datasets are available at https://github.com/wujwyi/PA-RAG.
arXiv.org Artificial Intelligence
Dec-18-2024
- Country:
- Asia
- China > Beijing
- Beijing (0.04)
- India > Uttar Pradesh (0.04)
- Japan (0.04)
- Middle East
- Bahrain (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- China > Beijing
- Europe
- Austria > Vienna (0.14)
- France (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Portugal (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.04)
- North America
- Canada > British Columbia
- United States
- Connecticut > Hartford County
- Bristol (0.04)
- California
- Contra Costa County > Walnut Creek (0.04)
- Nevada County (0.04)
- District of Columbia > Washington (0.04)
- Washington > King County
- Seattle (0.14)
- Idaho (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Nevada (0.04)
- Alaska
- Anchorage Municipality > Anchorage (0.04)
- Matanuska-Susitna Borough (0.04)
- Connecticut > Hartford County
- Pacific Ocean > North Pacific Ocean
- Bering Sea > Bristol Bay (0.04)
- South America
- Asia
- Genre:
- Personal (1.00)
- Research Report (0.63)
- Industry:
- Education (1.00)
- Government
- Leisure & Entertainment > Sports
- Motorsports > Formula One (1.00)
- Olympic Games (1.00)
- Soccer (0.68)
- Media (1.00)
- Technology: