A Joint Learning Approach for Semi-supervised Neural Topic Modeling
Chiu, Jeffrey, Mittal, Rajat, Tumma, Neehal, Sharma, Abhishek, Doshi-Velez, Finale
Topic models are some of the most popular ways to represent textual data in an interpret-able manner. Recently, advances in deep generative models, specifically auto-encoding variational Bayes (AEVB), have led to the introduction of unsupervised neural topic models, which leverage deep generative models as opposed to traditional statistics-based topic models. We extend upon these neural topic models by introducing the Label-Indexed Neural Topic Model (LI-NTM), which is, to the extent of our knowledge, the first effective upstream semi-supervised neural topic model. We find that LI-NTM outperforms existing neural topic models in document reconstruction benchmarks, with the most notable results in low labeled data regimes and for data-sets with informative labels; furthermore, our jointly learned classifier outperforms baseline classifiers in ablation studies.
Apr-7-2022
- Country:
- North America > United States
- Oregon > Multnomah County
- Portland (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Oregon > Multnomah County
- Asia > Middle East
- Jordan (0.05)
- Israel (0.04)
- Palestine > Gaza Strip
- Gaza Governorate > Gaza (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.68)
- Technology: