Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Zhang, Yue, Li, Yafu, Cui, Leyang, Cai, Deng, Liu, Lemao, Fu, Tingchen, Huang, Xinting, Zhao, Enbo, Zhang, Yu, Chen, Yulong, Wang, Longyue, Luu, Anh Tuan, Bi, Wei, Shi, Freda, Shi, Shuming

Sep-24-2023–arXiv.org Artificial Intelligence

While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge to the reliability of LLMs in real-world scenarios. In this paper, we survey recent efforts on the detection, explanation, and mitigation of hallucination, with an emphasis on the unique challenges posed by LLMs. We present taxonomies of the LLM hallucination phenomena and evaluation benchmarks, analyze existing approaches aiming at mitigating LLM hallucination, and discuss potential directions for future research.

hallucination, llm, preprint arxiv, (16 more...)

arXiv.org Artificial Intelligence

Sep-24-2023

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - United States
    - Illinois > Cook County
      - Chicago (0.04)
    - Colorado > Denver County
      - Denver (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Portugal (0.04)
  - France (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Croatia > Dubrovnik-Neretva County
    - Dubrovnik (0.04)
- Asia
  - China (0.04)
  - Indonesia > Bali (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE (0.04)

Genre:
- Overview (1.00)
- Research Report > New Finding (0.46)
- Instructional Material > Course Syllabus & Notes (0.46)

Industry:
- Leisure & Entertainment (1.00)
- Health & Medicine (1.00)
- Media (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found