Entropy Regularized Power k-Means Clustering

Chakraborty, Saptarshi, Paul, Debolina, Das, Swagatam, Xu, Jason

Jan-10-2020–arXiv.org Machine Learning

Despite its well-known shortcomings, $k$-means remains one of the most widely used approaches to data clustering. Current research continues to tackle its flaws while attempting to preserve its simplicity. Recently, the \textit{power $k$-means} algorithm was proposed to avoid trapping in local minima by annealing through a family of smoother surfaces. However, the approach lacks theoretical justification and fails in high dimensions when many features are irrelevant. This paper addresses these issues by introducing \textit{entropy regularization} to learn feature relevance while annealing. We prove consistency of the proposed approach and derive a scalable majorization-minimization algorithm that enjoys closed-form updates and convergence guarantees. In particular, our method retains the same computational complexity of $k$-means and power $k$-means, but yields significant improvements over both. Its merits are thoroughly assessed on a suite of real and synthetic data experiments.

algorithm, consistency, jnull 2, (17 more...)

arXiv.org Machine Learning

Jan-10-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - North Carolina > Durham County
    - Durham (0.04)
  - California > Alameda County
    - Oakland (0.04)
- Europe > United Kingdom
  - Scotland > City of Edinburgh
    - Edinburgh (0.04)
  - England > Cambridgeshire
    - Cambridge (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - India > West Bengal
    - Kolkata (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Health & Medicine (0.94)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found