Diving Deep into Modes of Fact Hallucinations in Dialogue Systems
Das, Souvik, Saha, Sougata, Srihari, Rohini K.
–arXiv.org Artificial Intelligence
Knowledge Graph(KG) grounded conversations often use large pre-trained models and usually suffer from fact hallucination. Frequently entities with no references in knowledge sources and conversation history are introduced into responses, thus hindering the flow of the conversation -- existing work attempt to overcome this issue by tweaking the training procedure or using a multi-step refining method. However, minimal effort is put into constructing an entity-level hallucination detection system, which would provide fine-grained signals that control fallacious content while generating responses. As a first step to address this issue, we dive deep to identify various modes of hallucination in KG-grounded chatbots through human feedback analysis. Secondly, we propose a series of perturbation strategies to create a synthetic dataset named FADE (FActual Dialogue Hallucination DEtection Dataset). Finally, we conduct comprehensive data analyses and create multiple baseline models for hallucination detection to compare against human-verified data and already established benchmarks.
arXiv.org Artificial Intelligence
Jan-11-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States > New York
- New York County > New York City (0.04)
- Erie County > Buffalo (0.04)
- Europe
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Italy > Tuscany
- Asia
- Singapore (0.04)
- China > Hong Kong (0.04)
- Middle East > Republic of Türkiye
- Batman Province > Batman (0.04)
- North America
- Genre:
- Research Report (0.50)
- Industry:
- Media > Film (0.46)
- Government (0.46)
- Leisure & Entertainment (0.46)
- Technology: