$\texttt{COSMIC}$: Mutual Information for Task-Agnostic Summarization Evaluation

Darrin, Maxime, Formont, Philippe, Cheung, Jackie Chi Kit, Piantanida, Pablo

Mar-1-2024–arXiv.org Artificial Intelligence

Assessing the quality of summarizers poses significant challenges. In response, we propose a novel task-oriented evaluation approach that assesses summarizers based on their capacity to produce summaries that are useful for downstream tasks, while preserving task outcomes. We theoretically establish a direct relationship between the resulting error probability of these tasks and the mutual information between source texts and generated summaries. We introduce $\texttt{COSMIC}$ as a practical implementation of this metric, demonstrating its strong correlation with human judgment-based metrics and its effectiveness in predicting downstream task performance. Comparative analyses against established metrics like $\texttt{BERTScore}$ and $\texttt{ROUGE}$ highlight the competitive performance of $\texttt{COSMIC}$.

information, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Mar-1-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Qatar (0.14)
- Europe (0.46)
- North America > Canada
  - Quebec (0.14)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.46)
  - Natural Language (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found