ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models

Neural Information Processing Systems 

We contend that utilizing existing relational databases is a promising approach to construct a benchmark that has both merits.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found