Extracting Keyphrases from Research Papers Using Citation Networks
Gollapalli, Sujatha Das (University of North Texas) | Caragea, Cornelia (University of North Texas)
Keyphrases for a document concisely describe the document using a small set of phrases. Keyphrases were previously shown to improve several document processing and retrieval tasks. In this work, we study keyphrase extraction from research papers by leveraging citation networks. We propose CiteTextRank for keyphrase extraction from research articles, a graph-based algorithm that incorporates evidence from both a document's content as well as the contexts in which the document is referenced within a citation network. Our model obtains significant improvements over the state-of-the-art models for this task. Specifically, on several datasets of research papers, CiteTextRank improves precision at rank 1 by as much as 9-20% over state-of-the-art baselines.
Jul-14-2014
- Country:
- North America > United States
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.14)
- Genre:
- Research Report > Promising Solution (0.48)
- Technology:
- Information Technology
- Information Management > Search (0.95)
- Data Science > Data Mining (0.70)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Machine Learning (1.00)
- Natural Language > Information Retrieval (0.47)
- Information Technology