Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data Cherie Ho1 Omar Alama 1
–Neural Information Processing Systems
Top-down Bird's Eye View (BEV) maps are a popular perceptual representation for ground robot navigation due to their richness and flexibility for downstream tasks. While recent methods have shown promise for predicting BEV maps from First-Person View (FPV) images, their generalizability is limited to small regions captured by current autonomous vehicle-based datasets. In this context, we show that a more scalable approach towards generalizable map prediction can be enabled by using two large-scale crowd-sourced mapping platforms, Mapillary for FPV images and OpenStreetMap for BEV semantic maps. We introduce Map It Anywhere (MIA), a data engine that enables seamless curation and modeling of labeled map prediction data from existing open-source map platforms. Using our MIA data engine, we display the ease of automatically collecting a dataset of 1.2 million pairs of FPV images & BEV maps encompassing diverse geographies, landscapes, environmental factors, camera models & capture scenarios.
Neural Information Processing Systems
May-30-2025, 02:43:03 GMT
- Country:
- North America > United States (0.94)
- Genre:
- Research Report (0.93)
- Industry:
- Information Technology > Security & Privacy (0.67)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning (1.00)
- Representation & Reasoning (1.00)
- Robots > Autonomous Vehicles (0.49)
- Vision (1.00)
- Communications > Social Media
- Crowdsourcing (0.35)
- Artificial Intelligence
- Information Technology