Improved Beam Search for Hallucination Mitigation in Abstractive Summarization
Sridhar, Arvind Krishna, Visser, Erik
–arXiv.org Artificial Intelligence
Advancement in large pretrained language models has significantly improved their performance for conditional language generation tasks including summarization albeit with hallucinations. To reduce hallucinations, conventional methods proposed improving beam search or using a fact checker as a postprocessing step. In this paper, we investigate the use of the Natural Language Inference (NLI) entailment metric to detect and prevent hallucinations in summary generation. We propose an NLI-assisted beam re-ranking mechanism by computing entailment probability scores between the input context and summarization model-generated beams during saliency-enhanced greedy decoding. Moreover, a diversity metric is introduced to compare its effectiveness against vanilla beam search. Our proposed algorithm significantly outperforms vanilla beam decoding on XSum and CNN/DM datasets.
arXiv.org Artificial Intelligence
Nov-14-2023
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain > Galicia
- Madrid (0.05)
- United Kingdom
- England
- Greater London > London (0.04)
- Greater Manchester > Manchester (0.04)
- Northern Ireland (0.04)
- Scotland (0.04)
- Wales (0.05)
- England
- Belgium > Brussels-Capital Region
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- Florida (0.05)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Seattle (0.04)
- Canada
- Oceania > Australia (0.04)
- Asia
- Genre:
- Research Report (0.64)
- Industry:
- Government > Regional Government
- Leisure & Entertainment > Sports (0.98)
- Transportation (0.68)
- Technology: