GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model
Li, Ling, Ye, Yu, Jiang, Bingchuan, Zeng, Wei
–arXiv.org Artificial Intelligence
This work tackles the problem of geo-localization with a new paradigm using a large vision-language model (LVLM) augmented with human inference knowledge. A primary challenge here is the scarcity of data for training the LVLM - existing street-view datasets often contain numerous low-quality images lacking visual clues, and lack any reasoning inference. To address the data-quality issue, we devise a CLIP-based network to quantify the degree of street-view images being locatable, leading to the creation of a new dataset comprising highly locatable street views. To enhance reasoning inference, we integrate external knowledge obtained from real geo-localization games, tapping into valuable human inference capabilities. The data are utilized to train GeoReasoner, which undergoes fine-tuning through dedicated reasoning and location-tuning stages. Qualitative and quantitative evaluations illustrate that GeoReasoner outperforms counterpart LVLMs by more than 25% at country-level and 38% at city-level geo-localization tasks, and surpasses StreetCLIP performance while requiring fewer training resources. The data and code are available at https://github.com/lingli1996/GeoReasoner.
arXiv.org Artificial Intelligence
Jun-3-2024
- Country:
- Asia
- China
- Beijing > Beijing (0.04)
- Fujian Province > Fuzhou (0.04)
- Guangdong Province > Guangzhou (0.04)
- Hong Kong (0.04)
- Tibet Autonomous Region (0.14)
- Zhejiang Province > Hangzhou (0.04)
- India (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Malaysia > Kuala Lumpur
- Kuala Lumpur (0.04)
- Middle East > Qatar
- Singapore > Central Region
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China
- Europe
- Austria > Vienna (0.14)
- France > Île-de-France
- Switzerland > Zürich
- Zürich (0.04)
- North America
- Canada > British Columbia
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- Massachusetts (0.04)
- Nevada > Clark County
- Las Vegas (0.05)
- New York (0.05)
- Florida > Miami-Dade County
- South America > Chile (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Transportation
- Ground > Road (0.68)
- Infrastructure & Services (0.46)
- Transportation
- Technology: