Coordinated Topic Modeling
Akash, Pritom Saha, Huang, Jie, Chang, Kevin Chen-Chuan
–arXiv.org Artificial Intelligence
We propose a new problem called coordinated topic modeling that imitates human behavior while describing a text corpus. It considers a set of well-defined topics like the axes of a semantic space with a reference representation. It then uses the axes to model a corpus for easily understandable representation. This new task helps represent a corpus more interpretably by reusing existing knowledge and benefits the corpora comparison task. We design ECTM, an embedding-based coordinated topic model that effectively uses the reference representation to capture the target corpus-specific aspects while maintaining each topic's global semantics. In ECTM, we introduce the topic- and document-level supervision with a self-training mechanism to solve the problem. Finally, extensive experiments on multiple domains show the superiority of our model over other baselines.
arXiv.org Artificial Intelligence
Oct-22-2022
- Country:
- North America > United States
- Illinois (0.04)
- Florida (0.04)
- New York > New York County
- New York City (0.04)
- Europe
- United Kingdom (0.04)
- Ireland (0.04)
- Russia > North Caucasian Federal District
- Chechen Republic (0.04)
- Asia
- Middle East > Jordan (0.05)
- India (0.04)
- Afghanistan (0.04)
- Russia (0.04)
- Japan (0.04)
- China (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Industry:
- Government
- Voting & Elections (0.67)
- Regional Government (0.67)
- Government
- Technology: