Event-based evaluation of abstractive news summarization

You, Huiling, Touileb, Samia, Velldal, Erik, Øvrelid, Lilja

Jul-3-2025–arXiv.org Artificial Intelligence

An abstractive summary of a news article contains its most important information in a condensed version. The evaluation of automatically generated summaries by generative language models relies heavily on human-authored summaries as gold references, by calculating overlapping units or similarity scores. News articles report events, and ideally so should the summaries. In this work, we propose to evaluate the quality of abstractive summaries by calculating overlapping events between generated summaries, reference summaries, and the original news articles. We experiment on a richly annotated Norwegian dataset comprising both events annotations and summaries authored by expert human annotators. Our approach provides more insight into the event information contained in the summaries.

computational linguistic, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jul-3-2025

arXiv.org PDF

Add feedback

Country:
- Asia > Singapore (0.04)
- North America
  - Dominican Republic (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Sweden (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Norway > Eastern Norway
    - Oslo (0.04)
  - Estonia > Harju County
    - Tallinn (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Text Processing (0.68)
  - Large Language Model (0.56)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found