LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search Elias Jääsaari
–Neural Information Processing Systems
Approximate nearest neighbor (ANN) search is a key component in many modern machine learning pipelines; recent use cases include retrieval-augmented generation (RAG) and vector databases. Clustering-based ANN algorithms, that use score computation methods based on product quantization (PQ), are often used in industrial-scale applications due to their scalability and suitability for distributed and disk-based implementations. However, they have slower query times than the leading graph-based ANN algorithms. In this work, we propose a new supervised score computation method based on the observation that inner product approximation is a multivariate (multi-output) regression problem that can be solved efficiently by reduced-rank regression. Our experiments show that on modern high-dimensional data sets, the proposed reduced-rank regression (RRR) method is superior to PQ in both query latency and memory usage.
Neural Information Processing Systems
May-25-2025, 15:28:21 GMT
- Country:
- Asia (0.28)
- Europe > Finland (0.14)
- North America > United States (0.14)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Education (0.46)
- Technology: