keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection

Vemula, Saketh Reddy, Krishnamurthy, Parameswari

May-26-2025–arXiv.org Artificial Intelligence

Identification of hallucination spans in black-box language model generated text is essential for applications in the real world. A recent attempt at this direction is SemEval-2025 Task 3, Mu-SHROOM-a Multilingual Shared Task on Hallucinations and Related Observable Over-generation Errors. In this work, we present our solution to this problem, which capitalizes on the variability of stochastically-sampled responses in order to identify hallucinated spans. Our hypothesis is that if a language model is certain of a fact, its sampled responses will be uniform, while hallucinated facts will yield different and conflicting results. We measure this divergence through entropy-based analysis, allowing for accurate identification of hallucinated segments. Our method is not dependent on additional training and hence is cost-effective and adaptable. In addition, we conduct extensive hyperparameter tuning and perform error analysis, giving us crucial insights into model behavior.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

May-26-2025

arXiv.org PDF

Add feedback

Country:
- North America > Panama (0.05)
- Oceania > Australia
  - New South Wales > Sydney (0.05)
- Europe
  - Sweden (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)

Genre:
- Research Report (0.64)

Industry:
- Leisure & Entertainment > Sports > Olympic Games (0.30)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found