Scalable Training of Mixture Models via Coresets
–Neural Information Processing Systems
How can we train a statistical mixture model on a massive data set? In this paper, we show how to construct coresets for mixtures of Gaussians and natural generalizations. A coreset is a weighted subset of the data, which guarantees that models fitting the coreset will also provide a good fit for the original data set. We show that, perhaps surprisingly, Gaussian mixtures admit coresets of size independent of the size of the data set.
Neural Information Processing Systems
Mar-14-2024, 23:53:37 GMT
- Country:
- Europe > Switzerland
- North America > United States
- California (0.14)
- South America > Brazil (0.04)
- Technology: