A Geometric Approach to $k$-means
Hong, Jiazhen, Qian, Wei, Chen, Yudong, Zhang, Yuqian
–arXiv.org Artificial Intelligence
\kmeans clustering is a fundamental problem in many scientific and engineering domains. The optimization problem associated with \kmeans clustering is nonconvex, for which standard algorithms are only guaranteed to find a local optimum. Leveraging the hidden structure of local solutions, we propose a general algorithmic framework for escaping undesirable local solutions and recovering the global solution or the ground truth clustering. This framework consists of iteratively alternating between two steps: (i) detect mis-specified clusters in a local solution, and (ii) improve the local solution by non-local operations. We discuss specific implementation of these steps, and elucidate how the proposed framework unifies many existing variants of \kmeans algorithms through a geometric perspective. We also present two natural variants of the proposed framework, where the initial number of clusters may be over- or under-specified. We provide theoretical justifications and extensive experiments to demonstrate the efficacy of the proposed approach.
arXiv.org Artificial Intelligence
Oct-24-2025
- Country:
- Asia
- Japan (0.04)
- Middle East > Jordan (0.04)
- North America > United States
- California > San Mateo County
- Menlo Park (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- California > San Mateo County
- Asia
- Genre:
- Research Report (1.00)
- Technology: