Angular Quantization-based Binary Codes for Fast Similarity Search Yunchao Gong Google Research, New York, NY10011, USA
–Neural Information Processing Systems
This paper focuses on the problem of learning binary codes for efficient retrieval of high-dimensional non-negative data that arises in vision and text applications where counts or frequencies are used as features. The similarity of such feature vectors is commonly measured using the cosine of the angle between them. In this work, we introduce a novel angular quantization-based binary coding (AQBC) technique for such data and analyze its properties. In its most basic form, AQBC works by mapping each non-negative feature vector onto the vertex of the binary hypercube with which it has the smallest angle. Even though the number of vertices (quantization landmarks) in this scheme grows exponentially with data dimensionality d, we propose a method for mapping feature vectors to their smallest-angle binary vertices that scales as O(d log d). Further, we propose a method for learning a linear transformation of the data to minimize the quantization error, and show that it results in improved binary codes. Experiments on image and text datasets show that the proposed AQBC method outperforms the state of the art.
Neural Information Processing Systems
Mar-14-2024, 20:36:05 GMT
- Country:
- Asia > Afghanistan
- Parwan Province > Charikar (0.04)
- North America > United States
- Illinois > Champaign County
- Champaign (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- North Carolina > Orange County
- Chapel Hill (0.04)
- Illinois > Champaign County
- Asia > Afghanistan
- Technology: