Accelerated Training of Large-Scale Gaussian Mixtures by a Merger of Sublinear Approaches

Hirschberger, Florian, Forster, Dennis, Lücke, Jörg

Oct-1-2018–arXiv.org Machine Learning

We combine two recent lines of research on sublinear clustering to significantly increase the efficiency of training Gaussian mixture models (GMMs) on large scale problems. First, we use a novel truncated variational EM approach for GMMs with isotropic Gaussians in order to increase clustering efficiency for large $C$ (many clusters). Second, we use recent coreset approaches to increase clustering efficiency for large $N$ (many data points). In order to derive a novel accelerated algorithm, we first show analytically how variational EM and coreset objectives can be merged to give rise to a new, combined clustering objective. Each iteration of the novel algorithm derived from this merged objective is then shown to have a run-time cost of $\mathcal{O}(N' G^2 D)$ per iteration, where $N'

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

Oct-1-2018

arXiv.org PDF

Add feedback

Country:
- Europe > Germany (0.14)

Genre:
- Instructional Material (0.40)
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found