Are Hallucinations Bad Estimations?

Liu, Hude, Hu, Jerry Yao-Chieh, Zhang, Jennifer Yuntong, Song, Zhao, Liu, Han

Sep-29-2025–arXiv.org Machine Learning

We formalize hallucinations in generative models as failures to link an estimate to any plausible cause. Under this interpretation, we show that even loss-minimizing optimal estimators still hallucinate. We confirm this with a general high probability lower bound on hallucinate rate for generic data distributions. This reframes hallucination as structural misalignment between loss minimization and human-acceptable outputs, and hence estimation errors induced by miscalibration. Experiments on coin aggregation, open-ended QA, and text-to-image support our theory.

estimator, hallucination, probability, (15 more...)

arXiv.org Machine Learning

Sep-29-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Illinois > Cook County
      - Evanston (0.04)
      - Chicago (0.04)
    - California
      - San Francisco County > San Francisco (0.14)
      - Alameda County > Berkeley (0.14)
  - Canada > Ontario
    - Toronto (0.14)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.93)
  - Natural Language
    - Large Language Model (0.68)
    - Generation (0.67)
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found