Interpretable Topic Extraction and Word Embedding Learning using row-stochastic DEDICOM
Hillebrand, Lars, Biesner, David, Bauckhage, Christian, Sifa, Rafet
–arXiv.org Artificial Intelligence
The DEDICOM algorithm provides a uniquely interpretable matrix factorization method for symmetric and asymmetric square matrices. We employ a new row-stochastic variation of DEDICOM on the pointwise mutual information matrices of text corpora to identify latent topic clusters within the vocabulary and simultaneously learn interpretable word embeddings. We introduce a method to efficiently train a constrained DEDICOM algorithm and a qualitative evaluation of its topic modeling and word embedding performance.
arXiv.org Artificial Intelligence
Jul-23-2025
- Country:
- Africa > Senegal
- Kolda Region > Kolda (0.04)
- Asia > Middle East
- Europe
- Belgium (0.04)
- France (0.04)
- Germany (0.04)
- Serbia (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- United Kingdom
- England > Greater London
- Wales (0.04)
- North America
- Canada (0.04)
- United States
- Colorado > Boulder County
- Boulder (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Colorado > Boulder County
- South America > Argentina (0.04)
- Africa > Senegal
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment > Sports > Soccer (0.48)
- Technology: