HaRiM$^+$: Evaluating Summary Quality with Hallucination Risk
Son, Seonil, Park, Junsoo, Hwang, Jeong-in, Lee, Junghwa, Noh, Hyungjong, Lee, Yeonsoo
–arXiv.org Artificial Intelligence
One of the challenges of developing a summarization model arises from the difficulty in measuring the factual inconsistency of the generated text. In this study, we reinterpret the decoder overconfidence-regularizing objective suggested in (Miao et al., 2021) as a hallucination risk measurement to better estimate the quality of generated summaries. We propose a reference-free metric, HaRiM+, which only requires an off-the-shelf summarization model to compute the hallucination risk based on token likelihoods. Deploying it requires no additional training of models or ad-hoc modules, which usually need alignment to human judgments. For summary-quality estimation, HaRiM+ records state-of-the-art correlation to human judgment on three summary-quality annotation sets: FRANK, QAGS, and SummEval. We hope that our work, which merits the use of summarization models, facilitates the progress of both automated evaluation and generation of summary.
arXiv.org Artificial Intelligence
Nov-24-2022
- Country:
- South America > Brazil (0.04)
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- Canada (0.04)
- United States
- South Carolina (0.04)
- Pennsylvania (0.04)
- Michigan (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Suffolk County
- Boston (0.04)
- Europe
- Germany > Berlin (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Finland > Central Finland
- Jyväskylä (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > China
- Hong Kong (0.04)
- Heilongjiang Province > Daqing (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Law (1.00)
- Government > Military (0.68)
- Leisure & Entertainment > Sports
- Soccer (1.00)
- Health & Medicine
- Therapeutic Area (1.00)
- Consumer Health (0.67)
- Technology: