$k$-Means Clustering for Persistent Homology
Cao, Yueqi, Leung, Prudence, Monod, Anthea
Persistent homology is a methodology central to topological data analysis that extracts and summarizes the topological features within a dataset as a persistence diagram; it has recently gained much popularity from its myriad successful applications to many domains. However, its algebraic construction induces a metric space of persistence diagrams with a highly complex geometry. In this paper, we prove convergence of the $k$-means clustering algorithm on persistence diagram space and establish theoretical properties of the solution to the optimization problem in the Karush--Kuhn--Tucker framework. Additionally, we perform numerical experiments on various representations of persistent homology, including embeddings of persistence diagrams as well as diagrams themselves and their generalizations as persistence measures; we find that $k$-means clustering performance directly on persistence diagrams and measures outperform their vectorized representations.
Nov-25-2023
- Country:
- North America > United States
- Massachusetts > Suffolk County > Boston (0.04)
- Europe > United Kingdom
- England
- Greater London > London (0.04)
- Cambridgeshire > Cambridge (0.04)
- England
- Asia > Japan
- Honshū > Chūgoku > Shimane Prefecture > Matsue (0.04)
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (0.93)
- Technology: