OAEI-LLM-T: A TBox Benchmark Dataset for Understanding LLM Hallucinations in Ontology Matching Systems

Mar-25-2025–arXiv.org Artificial Intelligence

Hallucinations are inevitable in downstream tasks using large language models (LLMs). While addressing hallucinations becomes a substantial challenge for LLM-based ontology matching (OM) systems, we introduce a new benchmark dataset called OAEI-LLM-T. The dataset evolves from the TBox (i.e. schema-matching) datasets in the Ontology Alignment Evaluation Initiative (OAEI), capturing hallucinations of different LLMs performing OM tasks. These OM-specific hallucinations are carefully classified into two primary categories and six sub-categories. We showcase the usefulness of the dataset in constructing the LLM leaderboard and fine-tuning foundational LLMs for LLM-based OM systems.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Mar-25-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Australian Capital Territory > Canberra (0.04)
- North America > United States
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - Florida > Escambia County
    - Pensacola (0.04)
- Europe
  - Italy (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
  - Greece > Attica
    - Athens (0.04)
  - Germany > North Rhine-Westphalia
    - Cologne Region > Bonn (0.04)
  - Austria > Styria
    - Graz (0.04)

Genre:
- Research Report (0.52)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found