KPDrop: Improving Absent Keyphrase Generation
Chowdhury, Jishnu Ray, Park, Seoyeon, Kundu, Tuhin, Caragea, Cornelia
–arXiv.org Artificial Intelligence
Keyphrase generation is the task of generating phrases (keyphrases) that summarize the main topics of a given document. Keyphrases can be either present or absent from the given document. While the extraction of present keyphrases has received much attention in the past, only recently a stronger focus has been placed on the generation of absent keyphrases. However, generating absent keyphrases is challenging; even the best methods show only a modest degree of success. In this paper, we propose a model-agnostic approach called keyphrase dropout (or KPDrop) to improve absent keyphrase generation. In this approach, we randomly drop present keyphrases from the document and turn them into artificial absent keyphrases during training. We test our approach extensively and show that it consistently improves the absent performance of strong baselines in both supervised and resource-constrained semi-supervised settings.
arXiv.org Artificial Intelligence
Oct-24-2022
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Maryland (0.04)
- Washington > King County
- Seattle (0.04)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Canada > British Columbia
- Europe
- Sweden > Uppsala County
- Uppsala (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Tuscany
- Florence (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Sweden > Uppsala County
- Asia
- Middle East
- China
- Heilongjiang Province > Daqing (0.04)
- Beijing > Beijing (0.04)
- North America
- Genre:
- Research Report (0.81)
- Overview (0.68)
- Technology: