MetaKP: On-Demand Keyphrase Generation
Wu, Di, Shen, Xiaoxian, Chang, Kai-Wei
–arXiv.org Artificial Intelligence
Traditional keyphrase prediction methods predict a single set of keyphrases per document, failing to cater to the diverse needs of users and downstream applications. To bridge the gap, we introduce on-demand keyphrase generation, a novel paradigm that requires keyphrases that conform to specific high-level goals or intents. For this task, we present MetaKP, a large-scale benchmark comprising four datasets, 7500 documents, and 3760 goals across news and biomedical domains with human-annotated keyphrases. Leveraging MetaKP, we design both supervised and unsupervised methods, including a multi-task fine-tuning approach and a self-consistency prompting method with large language models. The results highlight the challenges of supervised fine-tuning, whose performance is not robust to distribution shifts. By contrast, the proposed self-consistency prompting approach greatly improves the performance of large language models, enabling GPT-4o to achieve 0.548 SemF1, surpassing the performance of a fully fine-tuned BART-base model. Finally, we demonstrate the potential of our method to serve as a general NLP infrastructure, exemplified by its application in epidemic event detection from social media.
arXiv.org Artificial Intelligence
Jun-28-2024
- Country:
- Asia
- China > Heilongjiang Province
- Daqing (0.04)
- Japan > Honshū
- Chūbu > Aichi Prefecture
- Nagoya (0.04)
- Kansai > Osaka Prefecture
- Osaka (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.14)
- Chūbu > Aichi Prefecture
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Singapore (0.04)
- Thailand > Chiang Mai
- Chiang Mai (0.04)
- China > Heilongjiang Province
- Europe
- North America
- Canada > British Columbia
- United States
- Alaska (0.04)
- California
- Fresno County > Fresno (0.04)
- Los Angeles County > Los Angeles (0.14)
- Connecticut (0.05)
- Ohio > Lucas County
- Toledo (0.04)
- Pennsylvania (0.04)
- Washington > King County
- Seattle (0.04)
- Pacific Ocean > North Pacific Ocean
- Prince William Sound (0.04)
- Asia
- Genre:
- Research Report (0.82)
- Industry:
- Education > Educational Setting (0.68)
- Government > Regional Government
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.68)
- Therapeutic Area (1.00)
- Law (1.00)
- Technology: