Keyphrase Extraction with Sequential Pattern Mining
Wang, Qingren (Hefei University of Technology) | Sheng, Victor S. (University of Central Arkansas) | Wu, Xindong (University of Louisiana)
Existing studies show that extracting a complete keyphrase candidate set is the first and crucial step to extract high quality keyphrases from documents. Based on a common sense that words do not repeatedly appear in an effective keyphrase, we propose a novel algorithm named KCSP for document-specific keyphrase candidate search using sequential pattern mining with gap constraints, which only needs to scan a document once and automatically specifies appropriate gap constraints for words without users’ participation. The experimental results confirm that it helps improve the quality of keyphrase extraction.
Feb-14-2017
- Country:
- North America > United States
- Arkansas > Faulkner County
- Conway (0.15)
- Louisiana > Lafayette Parish
- Lafayette (0.15)
- Arkansas > Faulkner County
- North America > United States
- Technology: