Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages
Yuan, Michelle, Durme, Benjamin Van, Ying, Jordan L.
–Neural Information Processing Systems
Multilingual topic models can reveal patterns in cross-lingual document collections. However, existing models lack speed and interactivity, which prevents adoption in everyday corpora exploration or quick moving situations (e.g., natural disasters, political instability). First, we propose a multilingual anchoring algorithm that builds an anchor-based topic model for documents in different languages. Then, we incorporate interactivity to develop MTAnchor (Multilingual Topic Anchors), a system that allows users to refine the topic model. We test our algorithms on labeled English, Chinese, and Sinhalese documents.
Neural Information Processing Systems
Feb-14-2020, 20:25:27 GMT
- Technology: