OAEI-LLM: A Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching

Qiang, Zhangcheng, Taylor, Kerry, Wang, Weiqing, Jiang, Jing

Nov-11-2024–arXiv.org Artificial Intelligence

Hallucinations of large language models (LLMs) commonly occur in domain-specific downstream tasks, with no exception in ontology matching (OM). The prevalence of using LLMs for OM raises the need for benchmarks to better understand LLM hallucinations. The OAEI-LLM dataset is an extended version of the Ontology Alignment Evaluation Initiative (OAEI) datasets that evaluate LLM-specific hallucinations in OM tasks. We outline the methodology used in dataset construction and schema extension, and provide examples of potential use cases.

hallucination, llm hallucination, mapping, (10 more...)

arXiv.org Artificial Intelligence

Nov-11-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Switzerland (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
  - Australian Capital Territory > Canberra (0.04)
- North America > United States
  - New York > New York County > New York City (0.04)

Genre:
- Research Report (0.42)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Ontologies (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.51)