PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization

Ma, Xinbei, Gong, Yeyun, He, Pengcheng, Zhao, Hai, Duan, Nan

May-11-2023–arXiv.org Artificial Intelligence

Based on the remarkable achievements of pre-trained language models in abstractive summarization, the copying mechanism has proved helpful by improving the factuality, stability, and overall performance. This work proposes PROM, a new PhRase-level cOpying Mechanism that enhances attention on n-grams, which can be applied to zero-shot summarization with pre-training. PROM adds an indicator layer to explicitly pick up tokens in n-gram that can be copied from the source, and calculates an auxiliary loss for the copying prediction. Empirical studies show that PROM makes significant improvements in fine-tuning on benchmarks. In zero-shot setting, PROM is utilized in the self-supervised pre-training on raw corpora and provides new general baselines on a wide range of summarization datasets. Further analysis shows that PROM performs more reasonable copying and contributes to faithfulness.

computational linguistic, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

May-11-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)
- Asia
  - Afghanistan (0.04)
  - China
    - Hong Kong (0.04)
    - Shanghai > Shanghai (0.04)
  - Middle East > Iraq (0.04)
- Europe
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Germany > Berlin (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Sweden (0.05)
- North America
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
  - Dominican Republic (0.04)
  - United States
    - Louisiana > Orleans Parish
      - New Orleans (0.05)
    - Minnesota > Hennepin County
      - Minneapolis (0.28)
    - Washington > King County
      - Seattle (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- South America > Ecuador (0.04)

Genre:
- Research Report (0.64)

Industry:
- Government (0.46)
- Information Technology (0.47)
- Law (0.69)
- Media > News (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.46)
  - Natural Language > Large Language Model (0.56)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found