MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM

Jun-13-2026, 22:11:42 GMT–Neural Information Processing Systems

Multimodal hallucination in multimodal large language models (MLLMs) restricts the correctness of MLLMs. However, multimodal hallucinations are multi-sourced and arise from diverse causes. Existing benchmarks fail to adequately distinguish between perception-induced hallucinations and reasoning-induced hallucinations. This failure constitutes a significant issue and hinders the diagnosis of multimodal reasoning failures within MLLMs. To address this, we propose the MIRAGE benchmark, which isolates reasoning hallucinations by constructing questions where input images are correctly perceived by MLLMs yet reasoning errors persist.

artificial intelligence, large language model, natural language, (6 more...)

Neural Information Processing Systems

Jun-13-2026, 22:11:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)