ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models
–Neural Information Processing Systems
We contend that utilizing existing relational databases is a promising approach to construct a benchmark that has both merits.
Neural Information Processing Systems
Nov-19-2025, 00:21:47 GMT
- Country:
- Africa
- Middle East > Algeria
- Oum el-Bouaghi Province > Oum el Bouaghi (0.04)
- South Africa > Western Cape
- Cape Town (0.04)
- Middle East > Algeria
- Asia
- China (0.04)
- Indonesia > Bali (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Middle East > Qatar (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- South Korea > Incheon
- Incheon (0.04)
- Europe
- North America
- Canada > British Columbia
- Vancouver (0.04)
- Costa Rica (0.04)
- United States (0.46)
- Canada > British Columbia
- Oceania > Australia
- New South Wales (0.04)
- South America
- Argentina (0.04)
- Brazil (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Africa
- Genre:
- Research Report (0.87)
- Industry:
- Leisure & Entertainment > Sports
- Soccer (1.00)
- Media > Film (1.00)
- Transportation
- Air (0.94)
- Infrastructure & Services > Airport (0.69)
- Leisure & Entertainment > Sports
- Technology: