Affinity Clustering: Hierarchical Clustering at Scale
Mohammadhossein Bateni, Soheil Behnezhad, Mahsa Derakhshan, MohammadTaghi Hajiaghayi, Raimondas Kiveris, Silvio Lattanzi, Vahab Mirrokni
–Neural Information Processing Systems
Graph clustering is a fundamental task in many data-mining and machine-learning pipelines. In particular, identifying a good hierarchical structure is at the same time a fundamental and challenging problem for several applications. The amount of data to analyze is increasing at an astonishing rate each day. Hence there is a need for new solutions to efficiently compute effective hierarchical clusterings on such huge data. The main focus of this paper is on minimum spanning tree (MST) based clusterings. In particular, we propose affinity, a novel hierarchical clustering based on Borůvka's MST algorithm. We prove certain theoretical guarantees for affinity (as well as some other classic algorithms) and show that in practice it is superior to several other state-of-the-art clustering algorithms.
Neural Information Processing Systems
Oct-3-2024, 01:39:40 GMT
- Country:
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Afghanistan > Parwan Province
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England
- Spain > Catalonia
- North America
- Canada
- United States
- California
- Alameda County > Berkeley (0.04)
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- San Mateo County > Redwood City (0.04)
- Santa Clara County > Palo Alto (0.04)
- Maryland (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Nevada (0.04)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Seattle (0.04)
- California
- Asia
- Industry:
- Information Technology (0.46)
- Technology: