A Systematic Literature Review of Retrieval-Augmented Generation: Techniques, Metrics, and Challenges
Brown, Andrew, Roman, Muhammad, Devereux, Barry
–arXiv.org Artificial Intelligence
This systematic review of the research literature on retrieval-augmented generation (RAG) provides a focused analysis of the most highly cited studies published between 2020 and May 2025. A total of 128 articles met our inclusion criteria. The records were retrieved from ACM Digital Library, IEEE Xplore, Scopus, ScienceDirect, and the Digital Bibliography and Library Project (DBLP). RAG couples a neural retriever with a generative language model, grounding output in up-to-date, non-parametric memory while retaining the semantic generalisation stored in model weights. Guided by the PRISMA 2020 framework, we (i) specify explicit inclusion and exclusion criteria based on citation count and research questions, (ii) catalogue datasets, architectures, and evaluation practices, and (iii) synthesise empirical evidence on the effectiveness and limitations of RAG. To mitigate citation-lag bias, we applied a lower citation-count threshold to papers published in 2025 so that emerging breakthroughs with naturally fewer citations were still captured. This review clarifies the current research landscape, highlights methodological gaps, and charts priority directions for future research.
arXiv.org Artificial Intelligence
Sep-10-2025
- Country:
- Asia
- Indonesia > Bali (0.04)
- Japan > Honshū
- Kansai > Osaka Prefecture
- Osaka (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Tōhoku > Iwate Prefecture
- Morioka (0.04)
- Kansai > Osaka Prefecture
- Middle East
- Iran > Tehran Province
- Tehran (0.04)
- UAE (0.04)
- Iran > Tehran Province
- Pakistan (0.04)
- Europe
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Lithuania (0.04)
- Switzerland (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- United Kingdom > Northern Ireland (0.04)
- Italy > Calabria
- South America > Peru
- Cusco Department (0.04)
- Asia
- Genre:
- Instructional Material > Course Syllabus & Notes (1.00)
- Overview (1.00)
- Research Report
- Experimental Study (1.00)
- New Finding (0.86)
- Industry:
- Banking & Finance (0.67)
- Education > Educational Setting (0.67)
- Energy (1.00)
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Health Care Technology (1.00)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Cardiology/Vascular Diseases (1.00)
- Infections and Infectious Diseases (0.92)
- Neurology > Alzheimer's Disease (0.45)
- Information Technology > Security & Privacy (1.00)
- Law (1.00)
- Leisure & Entertainment (1.00)
- Media (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (1.00)
- Information Extraction (1.00)
- Large Language Model (1.00)
- Question Answering (1.00)
- Text Processing (1.00)
- Information Technology > Artificial Intelligence