Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets

Open in new window