Stars: Tera-Scale Graph Building for Clustering and Learning

Jan-17-2025, 03:34:17 GMT–Neural Information Processing Systems

A fundamental procedure in the analysis of massive datasets is the construction of similarity graphs. Such graphs play a key role for many downstream tasks, including clustering, classification, graph learning, and nearest neighbor search. For these tasks, it is critical to build graphs which are sparse yet still representative of the underlying data. The benefits of sparsity are twofold: firstly, constructing dense graphs is infeasible in practice for large datasets, and secondly, the runtime of downstream tasks is directly influenced by the sparsity of the similarity graph. In this work, we present Stars: a highly scalable method for building extremely sparse graphs via two-hop spanners, which are graphs where similar points are connected by a path of length at most two.

clustering and learning, graph, tera-scale graph building, (7 more...)

Neural Information Processing Systems

Jan-17-2025, 03:34:17 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.65)
  - Machine Learning (0.45)