PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization

Peper, Joseph J., Qiu, Wenzhao, Wang, Lu

Nov-16-2023–arXiv.org Artificial Intelligence

We investigate pre-training techniques for abstractive multi-document summarization (MDS), which is much less studied than summarizing single documents. Though recent work has demonstrated the effectiveness of highlighting information salience for pre-training strategy design, it struggles to generate abstractive and reflective summaries, which are critical properties for MDS. To this end, we present PELMS, a pre-trained model that uses objectives based on semantic coherence heuristics and faithfulness constraints with un-labeled multi-document inputs, to promote the generation of concise, fluent, and faithful summaries. To support the training of PELMS, we compile MultiPT, a multi-document pre-training corpus containing over 93 million documents to form more than 3 million unlabeled topic-centric document clusters, covering diverse genres such as product reviews, news, and general knowledge. We perform extensive evaluation of PELMS in low-shot settings on a wide range of MDS datasets. Our approach consistently outperforms competitive comparisons with respect to overall informativeness, abstractiveness, coherence, and faithfulness.

computational linguistic, dataset, summarization, (16 more...)

arXiv.org Artificial Intelligence

Nov-16-2023

arXiv.org PDF

Add feedback

Country:
- Asia > China
  - Hong Kong (0.04)
- Europe
  - Holy See > Vatican City (0.04)
  - Ireland (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - United Kingdom (0.04)
- North America
  - Canada > Ontario
    - Toronto (0.04)
  - United States
    - California > San Diego County
      - San Diego (0.04)
    - Michigan > Washtenaw County
      - Ann Arbor (0.14)
    - Washington > King County
      - Seattle (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:
- Research Report
  - Experimental Study (0.46)
  - New Finding (0.46)

Industry:
- Media (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.93)
  - Natural Language
    - Large Language Model (1.00)
    - Text Processing (0.67)