Neural Latent Extractive Document Summarization
Zhang, Xingxing, Lapata, Mirella, Wei, Furu, Zhou, Ming
–arXiv.org Artificial Intelligence
Extractive summarization models require sentence-level labels, which are usually created heuristically (e.g., with rule-based methods) given that most summarization datasets only have document-summary pairs. Since these labels might be suboptimal, we propose a latent variable extractive model where sentences are viewed as latent variables and sentences with activated variables are used to infer gold summaries. During training the loss comes \emph{directly} from gold summaries. Experiments on the CNN/Dailymail dataset show that our model improves over a strong extractive baseline trained on heuristically approximated labels and also performs competitively to several recent models.
arXiv.org Artificial Intelligence
Aug-28-2018
- Country:
- North America
- United States
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Los Angeles (0.14)
- San Diego County > San Diego (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Michigan > Washtenaw County
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > British Columbia
- United States
- Europe
- Germany > Berlin (0.04)
- Sweden > Uppsala County
- Uppsala (0.04)
- Spain
- Valencian Community > Valencia Province
- Valencia (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Valencian Community > Valencia Province
- Portugal > Lisbon
- Lisbon (0.04)
- Asia
- Middle East > Jordan (0.04)
- China > Beijing
- Beijing (0.04)
- North America
- Genre:
- Research Report (0.50)
- Technology: