Sparse Stochastic Inference for Latent Dirichlet allocation
Mimno, David, Hoffman, Matt, Blei, David
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs sampling with the scalability of online stochastic inference. We used our algorithm to analyze a corpus of 1.2 million books (33 billion words) with thousands of topics. Our approach reduces the bias of variational inference and generalizes to many Bayesian hidden-variable models.
Jun-27-2012
- Country:
- Europe > United Kingdom
- Scotland (0.14)
- North America > United States (0.46)
- Europe > United Kingdom
- Genre:
- Research Report > Experimental Study (0.47)