SuperVLAD: Compact and Robust Image Descriptors for Visual Place Recognition Feng Lu1,2 Canming Ye1 Shuting Dong

May-28-2025, 09:58:11 GMT–Neural Information Processing Systems

Visual place recognition (VPR) is an essential task for multiple applications such as augmented reality and robot localization. Over the past decade, mainstream methods in the VPR area have been to use feature representation based on global aggregation, as exemplified by NetVLAD. These features are suitable for largescale VPR and robust against viewpoint changes. However, the VLAD-based aggregation methods usually learn a large number of (e.g., 64) clusters and their corresponding cluster centers, which directly leads to a high dimension of the yielded global features. More importantly, when there is a domain gap between the data in training and inference, the cluster centers determined on the training set are usually improper for inference, resulting in a performance drop.

data mining, machine learning, supervlad, (21 more...)

Neural Information Processing Systems

May-28-2025, 09:58:11 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.46)
- Europe > Switzerland
  - Zürich > Zürich (0.14)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.67)

Industry:
- Information Technology (0.67)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks > Deep Learning (0.68)
      - Statistical Learning (0.69)
    - Natural Language (1.00)
    - Robots (1.00)
    - Vision (1.00)
  - Data Science > Data Mining (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)