Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation
Lee, Dongha, Shen, Jiaming, Lee, Seonghyeon, Yoon, Susik, Yu, Hwanjo, Han, Jiawei
–arXiv.org Artificial Intelligence
Topic taxonomies display hierarchical topic structures of a text corpus and provide topical knowledge to enhance various NLP applications. To dynamically incorporate new topic information, several recent studies have tried to expand (or complete) a topic taxonomy by inserting emerging topics identified in a set of new documents. However, existing methods focus only on frequent terms in documents and the local topic-subtopic relations in a taxonomy, which leads to limited topic term coverage and fails to model the global topic hierarchy. In this work, we propose a novel framework for topic taxonomy expansion, named TopicExpan, which directly generates topic-related terms belonging to new topics. Specifically, TopicExpan leverages the hierarchical relation structure surrounding a new topic and the textual content of an input document for topic term generation. This approach encourages newly-inserted topics to further cover important but less frequent terms as well as to keep their relation consistency within the taxonomy. Experimental results on two real-world text corpora show that TopicExpan significantly outperforms other baseline methods in terms of the quality of output taxonomies.
arXiv.org Artificial Intelligence
Oct-18-2022
- Country:
- Asia (1.00)
- Europe (0.67)
- North America > United States
- Minnesota (0.28)
- Genre:
- Research Report (1.00)
- Industry:
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
- Beverages (0.68)
- Leisure & Entertainment > Sports
- Basketball (0.67)
- Soccer (0.93)
- Media (0.93)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
- Technology: