HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection

May-27-2025, 14:26:57 GMT–Neural Information Processing Systems

The surge in applications of large language models (LLMs) has prompted concerns about the generation of misleading or fabricated information, known as hallucinations. Therefore, detecting hallucinations has become critical to maintaining trust in LLM-generated content. A primary challenge in learning a truthfulness classifier is the lack of a large amount of labeled truthful and hallucinated data. To address the challenge, we introduce HaloScope, a novel learning framework that leverages the unlabeled LLM generations in the wild for hallucination detection. Such unlabeled data arises freely upon deploying LLMs in the open world, and consists of both truthful and hallucinated information.

hallucination detection, haloscope, harnessing unlabeled llm generation, (4 more...)

Neural Information Processing Systems

May-27-2025, 14:26:57 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)