OAEI-LLM: A Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching
Qiang, Zhangcheng, Taylor, Kerry, Wang, Weiqing, Jiang, Jing
–arXiv.org Artificial Intelligence
Hallucinations of large language models (LLMs) commonly occur in domain-specific downstream tasks, with no exception in ontology matching (OM). The prevalence of using LLMs for OM raises the need for benchmarks to better understand LLM hallucinations. The OAEI-LLM dataset is an extended version of the Ontology Alignment Evaluation Initiative (OAEI) datasets that evaluate LLM-specific hallucinations in OM tasks. We outline the methodology used in dataset construction and schema extension, and provide examples of potential use cases.
arXiv.org Artificial Intelligence
Nov-11-2024
- Country:
- Europe > Switzerland (0.04)
- North America > United States
- New York > New York County > New York City (0.04)
- Oceania > Australia
- Australian Capital Territory > Canberra (0.04)
- Victoria > Melbourne (0.04)
- Genre:
- Research Report (0.42)
- Technology: