DSFormer: A Dual-Scale Cross-Learning Transformer for Visual Place Recognition
Jiang, Haiyang, Piao, Songhao, Gao, Chao, Yu, Lei, Chen, Liguo
–arXiv.org Artificial Intelligence
--Visual Place Recognition (VPR) is crucial for robust mobile robot localization, yet it faces significant challenges in maintaining reliable performance under varying environmental conditions and viewpoints. T o address this, we propose a novel framework that integrates Dual-Scale-Former (DSFormer), a Transformer-based cross-learning module, with an innovative block clustering strategy. DSFormer enhances feature representation by enabling bidirectional information transfer between dual-scale features extracted from the final two CNN layers, capturing both semantic richness and spatial details through self-attention for long-range dependencies within each scale and shared cross-attention for cross-scale learning. Complementing this, our block clustering strategy repartitions the widely used San Francisco eXtra Large (SF-XL) training dataset from multiple distinct perspectives, optimizing data organization to further bolster robustness against viewpoint variations. T ogether, these innovations not only yield a robust global embedding adaptable to environmental changes but also reduce the required training data volume by approximately 30% compared to previous partitioning methods. Comprehensive experiments demonstrate that our approach achieves state-of-the-art performance across most benchmark datasets, surpassing advanced reranking methods like DELG, Patch-NetVLAD, TransVPR, and R2Former as a global retrieval solution using 512-dim global descriptors, while significantly improving computational efficiency. PR serves as a fundamental capability in robotic systems, enabling robots to coarsely locate themselves within an environment by matching visual inputs to a pre-existing geo-tagged database, which is critical for robotics to large-scale geolocation tasks.
arXiv.org Artificial Intelligence
Jul-25-2025
- Country:
- Asia
- China
- Beijing > Beijing (0.04)
- Heilongjiang Province > Harbin (0.04)
- Hubei Province > Wuhan (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
- China
- North America > United States
- California > San Francisco County > San Francisco (0.24)
- Asia
- Genre:
- Research Report > Promising Solution (0.46)
- Technology: