A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Cheng, Mingyue, Luo, Yucong, Ouyang, Jie, Liu, Qi, Liu, Huijie, Li, Li, Yu, Shuo, Zhang, Bohou, Cao, Jiawei, Ma, Jie, Wang, Daoyu, Chen, Enhong
–arXiv.org Artificial Intelligence
Retrieval-Augmented Generation (RAG) has gained significant attention in recent years for its potential to enhance natural language understanding and generation by combining large-scale retrieval systems with generative models. RAG leverages external knowledge sources, such as documents, databases, or structured data, to improve model performance and generate more accurate and contextually relevant outputs. This survey aims to provide a comprehensive overview of RAG by examining its fundamental components, including retrieval mechanisms, generation processes, and the integration between the two. We discuss the key characteristics of RAG, such as its ability to augment generative models with dynamic external knowledge, and the challenges associated with aligning retrieved information with generative objectives. We also present a taxonomy that categorizes RAG methods, ranging from basic retrieval-augmented approaches to more advanced models incorporating multi-modal data and reasoning capabilities. Additionally, we review the evaluation benchmarks and datasets commonly used to assess RAG systems, along with a detailed exploration of its applications in fields such as question answering, summarization, and information retrieval. Finally, we highlight emerging research directions and opportunities for improving RAG systems, such as enhanced retrieval efficiency, model interpretability, and domain-specific adaptations. This paper concludes by outlining the prospects for RAG in addressing real-world challenges and its potential to drive further advancements in natural language processing.
arXiv.org Artificial Intelligence
Mar-17-2025
- Country:
- North America
- United States
- District of Columbia > Washington (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Arizona > Maricopa County
- Scottsdale (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Italy > Calabria
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East
- China > Liaoning Province
- Shenyang (0.04)
- Taiwan > Taiwan Province
- North America
- Genre:
- Overview (1.00)
- Research Report > Promising Solution (0.45)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Banking & Finance (1.00)
- Government (0.92)
- Media (0.68)
- Leisure & Entertainment (0.68)
- Law (0.67)
- Health & Medicine > Therapeutic Area
- Neurology (0.45)
- Education > Educational Technology
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (1.00)
- Representation & Reasoning
- Natural Language
- Text Processing (1.00)
- Large Language Model (1.00)
- Information Retrieval (1.00)
- Chatbot (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence