Subnational Geocoding of Global Disasters Using Large Language Models
Ronco, Michele, Delforge, Damien, Jäger, Wiebke S., Corbane, Christina
–arXiv.org Artificial Intelligence
Subnational location data of disaster events are critical for risk assessment and disaster risk reduction. Disaster databases such as EM-DAT often report locations in unstructured textual form, with inconsistent granularity or spelling, that make it difficult to integrate with spatial datasets. We present a fully automated LLM-assisted workflow that processes and cleans textual location information using GPT-4o, and assigns geometries by cross-checking three independent geoinformation repositories: GADM, OpenStreetMap and Wikidata. Based on the agreement and availability of these sources, we assign a reliability score to each location while generating subnational geometries. Applied to the EM-DAT dataset from 2000 to 2024, the workflow geocodes 14,215 events across 17,948 unique locations. Unlike previous methods, our approach requires no manual intervention, covers all disaster types, enables cross-verification across multiple sources, and allows flexible remapping to preferred frameworks. Beyond the dataset, we demonstrate the potential of LLMs to extract and structure geographic information from unstructured text, offering a scalable and reliable method for related analyses.
arXiv.org Artificial Intelligence
Nov-20-2025
- Country:
- Africa (0.04)
- Asia
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.04)
- China > Chongqing Province
- Chongqing (0.04)
- India
- Maharashtra (0.04)
- Uttar Pradesh (0.04)
- Japan > Honshū
- Tōhoku > Miyagi Prefecture > Sendai (0.05)
- Pakistan > Sindh (0.04)
- Philippines (0.04)
- Bangladesh > Dhaka Division
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Italy (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Belgium > Brussels-Capital Region
- North America > United States
- Oceania (0.04)
- Genre:
- Research Report (0.64)
- Workflow (0.88)
- Industry:
- Information Technology (0.34)
- Technology: