L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery
–Neural Information Processing Systems
We tackle the challenge of LiDAR-based place recognition, which traditionally depends on costly and time-consuming prior 3D maps. To overcome this, we first construct LiRSI-XA dataset, which encompasses approximately 110,000 remote sensing submaps and 13,000 LiDAR point cloud submaps captured in urban scenes, and propose a novel method, L2RSI, for cross-view LiDAR place recognition using high-resolution Remote Sensing Imagery. This approach enables large-scale localization capabilities at a reduced cost by leveraging readily available overhead images as map proxies. L2RSI addresses the dual challenges of cross-view and cross-modal place recognition by learning feature alignment between point cloud submaps and remote sensing submaps in the semantic domain. Additionally, we introduce a novel probability propagation method based on particle estimation to refine position predictions, effectively leveraging temporal and spatial information. This approach enables large-scale retrieval and cross-scene generalization without fine-tuning. Extensive experiments on LiRSI-XA demonstrate that, within a 100km2 retrieval range, L2RSI accurately localizes 83.27% of point cloud submaps within a 30m radius for top-1 retrieved location. Our project page is publicly available at https://shizw695.github.io/L2RSI/.
Neural Information Processing Systems
Jun-23-2026, 01:31:25 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Technology:
- Information Technology
- Sensing and Signal Processing (1.00)
- Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Natural Language (1.00)
- Machine Learning > Neural Networks (1.00)
- Representation & Reasoning > Spatial Reasoning (0.66)
- Information Technology