HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization

Liu, Ye, Zhang, Jian-Guo, Wan, Yao, Xia, Congying, He, Lifang, Yu, Philip S.

Oct-19-2021–arXiv.org Artificial Intelligence

To capture the semantic graph structure from raw text, most existing summarization approaches are built on GNNs with a pre-trained model. However, these methods suffer from cumbersome procedures and inefficient computations for long-text documents. To mitigate these issues, this paper proposes HETFORMER, a Transformer-based pre-trained model with multi-granularity sparse attentions for long-text extractive summarization. Specifically, we model different types of semantic nodes in raw text as a potential heterogeneous graph and directly learn heterogeneous relationships (edges) among nodes by Transformer. Extensive experiments on both single- and multi-document summarization tasks show that HETFORMER achieves state-of-the-art performance in Rouge F1 while using less memory and fewer parameters.

node, proceedings, summarization, (14 more...)

arXiv.org Artificial Intelligence

Oct-19-2021

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States
  - Pennsylvania > Northampton County
    - Bethlehem (0.04)
  - Illinois > Cook County
    - Chicago (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Processing (1.00)
  - Machine Learning (1.00)