Accelerated Variational Dirichlet Process Mixtures
Kurihara, Kenichi, Welling, Max, Vlassis, Nikos
–Neural Information Processing Systems
Dirichlet Process (DP) mixture models are promising candidates for clustering applications where the number of clusters is unknown a priori. Due to computational considerations these models are unfortunately unsuitable for large scale data-mining applications. We propose a class of deterministic accelerated DP mixture models that can routinely handle millions of data-cases. The speedup is achieved by incorporating kd-trees into a variational Bayesian algorithm for DP mixtures in the stick-breaking representation, similar to that of Blei and Jordan (2005). Our algorithm differs in the use of kd-trees and in the way we handle truncation: we only assume that the variational distributions are fixed at their priors after a certain level. Experiments show that speedups relative to the standard variational algorithm can be significant.
Neural Information Processing Systems
Dec-31-2007
- Country:
- Asia
- Japan (0.14)
- Middle East > Jordan (0.25)
- Europe > Netherlands (0.14)
- North America > United States (0.14)
- Asia
- Genre:
- Research Report (0.34)
- Technology: