UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos
Liu, Mingxuan, He, Honglin, Ricci, Elisa, Wu, Wayne, Zhou, Bolei
–arXiv.org Artificial Intelligence
Urban embodied AI agents, ranging from delivery robots to quadrupeds, are increasingly populating our cities, navigating chaotic streets to provide last-mile connectivity. Training such agents requires diverse, high-fidelity urban environments to scale, yet existing human-crafted or procedurally generated simulation scenes either lack scalability or fail to capture real-world complexity. We introduce UrbanVerse, a data-driven real-to-sim system that converts crowd-sourced city-tour videos into physics-aware, interactive simulation scenes. UrbanVerse consists of: (i) UrbanVerse-100K, a repository of 100k+ annotated urban 3D assets with semantic and physical attributes, and (ii) UrbanVerse-Gen, an automatic pipeline that extracts scene layouts from video and instantiates metric-scale 3D simulations using retrieved assets. Running in IsaacSim, UrbanVerse offers 160 high-quality constructed scenes from 24 countries, along with a curated benchmark of 10 artist-designed test scenes. Experiments show that UrbanVerse scenes preserve real-world semantics and layouts, achieving human-evaluated realism comparable to manually crafted scenes. In urban navigation, policies trained in UrbanVerse exhibit scaling power laws and strong generalization, improving success by +6.3% in simulation and +30.1% in zero-shot sim-to-real transfer comparing to prior methods, accomplishing a 300 m real-world mission with only two interventions.
arXiv.org Artificial Intelligence
Oct-20-2025
- Country:
- Africa
- Kenya > Nairobi City County
- Nairobi (0.04)
- Middle East
- Egypt > Cairo Governorate
- Cairo (0.04)
- Morocco > Tanger-Tetouan-Al Hoceima Region
- Tangier (0.04)
- Egypt > Cairo Governorate
- Nigeria (0.04)
- South Africa > Western Cape
- Cape Town (0.04)
- Kenya > Nairobi City County
- Asia
- China > Beijing
- Beijing (0.04)
- India > NCT
- New Delhi (0.04)
- Japan > Honshū
- Kansai > Kyoto Prefecture
- Kyoto (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Kansai > Kyoto Prefecture
- Kazakhstan > Almaty Region
- Almaty (0.04)
- Middle East
- Saudi Arabia > Riyadh Province
- Riyadh (0.04)
- UAE > Dubai Emirate
- Dubai (0.04)
- Saudi Arabia > Riyadh Province
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Vietnam > Hồ Chí Minh City
- Hồ Chí Minh City (0.04)
- China > Beijing
- Europe
- France (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Italy (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico (0.04)
- United States > California
- Los Angeles County > Los Angeles (0.14)
- Canada > Ontario
- Oceania
- Australia (0.04)
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- South America
- Argentina > Pampas
- Buenos Aires F.D. > Buenos Aires (0.04)
- Brazil > Rio de Janeiro
- Rio de Janeiro (0.04)
- Colombia (0.04)
- Argentina > Pampas
- Africa
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology (0.68)
- Leisure & Entertainment > Games
- Computer Games (0.46)
- Transportation > Ground
- Road (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language > Large Language Model (0.88)
- Representation & Reasoning (1.00)
- Robots > Autonomous Vehicles (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence