Differentiable N-gram Objective on Abstractive Summarization
Zhu, Yunqi, Yang, Xuebing, Wu, Yuanyuan, Zhu, Mingjin, Zhang, Wensheng
–arXiv.org Artificial Intelligence
ROUGE is a standard automatic evaluation metric based on n-grams for sequence-to-sequence tasks, while cross-entropy loss is an essential objective of neural network language model that optimizes at a unigram level. We present differentiable n-gram objectives, attempting to alleviate the discrepancy between training criterion and evaluating criterion. The objective maximizes the probabilistic weight of matched sub-sequences, and the novelty of our work is the objective weights the matched sub-sequences equally and does not ceil the number of matched sub-sequences by the ground truth count of n-grams in reference sequence. We jointly optimize cross-entropy loss and the proposed objective, providing decent ROUGE score enhancement over abstractive summarization dataset CNN/DM and XSum, outperforming alternative n-gram objectives.
arXiv.org Artificial Intelligence
Dec-25-2022
- Country:
- North America > United States
- California
- San Bernardino County (0.04)
- Los Angeles County (0.04)
- California
- Europe > United Kingdom
- England (0.04)
- Asia > China
- Hainan Province > Haikou (0.04)
- Guangdong Province > Guangzhou (0.04)
- Beijing > Beijing (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Industry:
- Government > Regional Government (0.94)
- Law Enforcement & Public Safety (0.68)
- Technology: