SuperVLAD: Compact and Robust Image Descriptors for Visual Place Recognition
–Neural Information Processing Systems
These features are suitable for large-scale VPR and robust against viewpoint changes. However, the VLAD-based aggregation methods usually learn a large number of ( e.g., 64) clusters and their corresponding cluster centers, which directly leads to a high dimension of the yielded global features. More importantly, when there is a domain gap between the data in training and inference, the cluster centers determined on the training set are usually improper for inference, resulting in a performance drop. To this end, we first attempt to improve NetVLAD by removing the cluster center and setting only a small number of ( e.g., only 4) clusters.
Neural Information Processing Systems
Nov-13-2025, 12:33:19 GMT
- Country:
- Asia
- China
- Beijing > Beijing (0.04)
- Guangdong Province > Shenzhen (0.04)
- Guangxi Province > Nanning (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- China
- Europe > Switzerland
- North America > United States
- California > San Francisco County > San Francisco (0.04)
- Asia
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.67)
- Research Report
- Industry:
- Information Technology (0.67)
- Transportation > Ground
- Road (0.46)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.68)
- Statistical Learning (0.69)
- Natural Language (1.00)
- Robots (0.94)
- Vision (1.00)
- Machine Learning
- Data Science > Data Mining (1.00)
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology