A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

Neural Information Processing Systems 

Despite Retrieval-Augmented Generation (RAG) showing promising capability in leveraging external knowledge, a comprehensive evaluation of RAG systems is still challenging due to the modular nature of RAG, evaluation of long-form responses and reliability of measurements.