Event-based evaluation of abstractive news summarization
You, Huiling, Touileb, Samia, Velldal, Erik, Øvrelid, Lilja
–arXiv.org Artificial Intelligence
An abstractive summary of a news article contains its most important information in a condensed version. The evaluation of automatically generated summaries by generative language models relies heavily on human-authored summaries as gold references, by calculating overlapping units or similarity scores. News articles report events, and ideally so should the summaries. In this work, we propose to evaluate the quality of abstractive summaries by calculating overlapping events between generated summaries, reference summaries, and the original news articles. We experiment on a richly annotated Norwegian dataset comprising both events annotations and summaries authored by expert human annotators. Our approach provides more insight into the event information contained in the summaries.
arXiv.org Artificial Intelligence
Jul-3-2025
- Country:
- Asia > Singapore (0.04)
- Europe
- Estonia > Harju County
- Tallinn (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden (0.04)
- Estonia > Harju County
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- Canada > Ontario
- Genre:
- Research Report (1.00)
- Technology: