OAEI-LLM-T: A TBox Benchmark Dataset for Understanding LLM Hallucinations in Ontology Matching Systems
–arXiv.org Artificial Intelligence
Hallucinations are inevitable in downstream tasks using large language models (LLMs). While addressing hallucinations becomes a substantial challenge for LLM-based ontology matching (OM) systems, we introduce a new benchmark dataset called OAEI-LLM-T. The dataset evolves from the TBox (i.e. schema-matching) datasets in the Ontology Alignment Evaluation Initiative (OAEI), capturing hallucinations of different LLMs performing OM tasks. These OM-specific hallucinations are carefully classified into two primary categories and six sub-categories. We showcase the usefulness of the dataset in constructing the LLM leaderboard and fine-tuning foundational LLMs for LLM-based OM systems.
arXiv.org Artificial Intelligence
Mar-25-2025
- Country:
- Europe
- Austria > Styria
- Graz (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Bonn (0.04)
- Greece > Attica
- Athens (0.04)
- Italy (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Austria > Styria
- North America > United States
- Florida > Escambia County
- Pensacola (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Florida > Escambia County
- Oceania > Australia
- Australian Capital Territory > Canberra (0.04)
- Europe
- Genre:
- Research Report (0.52)
- Technology: