Unsupervised Summarization Re-ranking

Ravaut, Mathieu, Joty, Shafiq, Chen, Nancy

May-26-2023–arXiv.org Artificial Intelligence

With the rise of task-specific pre-training objectives, abstractive summarization models like PEGASUS offer appealing zero-shot performance on downstream summarization tasks. However, the performance of such unsupervised models still lags significantly behind their supervised counterparts. Similarly to the supervised setup, we notice a very high variance in quality among summary candidates from these models while only one candidate is kept as the summary output. In this paper, we propose to re-rank summary candidates in an unsupervised manner, aiming to close the performance gap between unsupervised and supervised models. Our approach improves the unsupervised PEGASUS by up to 7.27% and ChatGPT by up to 6.86% relative mean ROUGE across four widely-adopted summarization benchmarks ; and achieves relative gains of 7.51% (up to 23.73% from XSum to WikiHow) averaged over 30 zero-shot transfer setups (finetuning on a dataset, evaluating on another).

machine learning, natural language, score mean rouge, (17 more...)

arXiv.org Artificial Intelligence

May-26-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)
- Europe (1.00)
- Asia > Middle East
  - Republic of Türkiye (0.92)

Genre:
- Research Report (1.00)
- Personal (0.93)

Industry:
- Law (1.00)
- Health & Medicine (1.00)
- Information Technology (0.67)
- Government
  - Voting & Elections (1.00)
  - Regional Government
    - North America Government > United States Government (1.00)
    - Asia Government > Middle East Government
      - Republic of Türkiye Government (0.45)
- Energy > Oil & Gas
  - Upstream (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found