Affinity Clustering: Hierarchical Clustering at Scale
Mohammadhossein Bateni, Soheil Behnezhad, Mahsa Derakhshan, MohammadTaghi Hajiaghayi, Raimondas Kiveris, Silvio Lattanzi, Vahab Mirrokni
–Neural Information Processing Systems
Graph clustering is a fundamental task in many data-mining and machine-learning pipelines. In particular, identifying a good hierarchical structure is at the same time a fundamental and challenging problem for several applications. The amount of data to analyze is increasing at an astonishing rate each day. Hence there is a need for new solutions to efficiently compute effective hierarchical clusterings on such huge data. The main focus of this paper is on minimum spanning tree (MST) based clusterings. In particular, we propose affinity, a novel hierarchical clustering based on Borůvka's MST algorithm. We prove certain theoretical guarantees for affinity (as well as some other classic algorithms) and show that in practice it is superior to several other state-of-the-art clustering algorithms.
Neural Information Processing Systems
Oct-3-2024, 01:39:40 GMT
- Country:
- North America
- United States
- Maryland (0.04)
- Nevada (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Mateo County > Redwood City (0.04)
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Alameda County > Berkeley (0.04)
- Canada
- United States
- Europe
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Scotland > City of Edinburgh
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom
- Asia
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Afghanistan > Parwan Province
- Charikar (0.04)
- Middle East > Israel
- North America
- Industry:
- Information Technology (0.46)
- Technology: