Visualizing Topic Models
Chaney, Allison June-Barlow (Princeton University) | Blei, David M. (Princeton University)
Managing large collections of documents is an important problem for many areas of science, industry, and culture. Probabilistic topic modeling offers a promising solution. Topic modeling is an unsupervised machine learning method that learns the underlying themes in a large collection of otherwise unorganized documents. This discovered structure summarizes and organizes the documents. However, topic models are high-level statistical tools—a user must scrutinize numerical distributions to understand and explore their results. In this paper, we present a method for visualizing topic models. Our method creates a navigator of the documents, allowing users to explore the hidden structure that a topic model discovers. These browsing interfaces reveal meaningful patterns in a collection, helping end-users explore and understand its contents in new ways. We provide open source software of our method.
Feb-22-2012
- Country:
- Asia > Middle East
- Jordan (0.05)
- North America > United States
- New Jersey > Mercer County
- Princeton (0.04)
- New York (0.04)
- New Jersey > Mercer County
- South America > Paraguay
- Asia > Middle East
- Technology: