CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search

Neural Information Processing Systems 

The state-of-the-art approximate nearest neighbor search (ANNS) algorithm builds a large proximity graph on the dataset and performs a greedy beam search, which may bring many unnecessary explorations. We develop a novel framework, namely, based on random partitioning of the dataset. It produces a smaller sparse proximity graph for each partition and routing vectors that bind all the partitions. An efficient two-staged approach is designed for exploring, with fast approaching and cross-partition expansion. We theoretically prove that can accelerate the existing graph-based ANNS algorithms by reducing unnecessary explorations. In addition, we conduct extensive experiments on benchmark datasets. The experimental results confirm that the existing graph-based methods can be significantly outperformed by incorporating, achieving 1.5x to 2x speedups of in almost all recalls.