SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting
Park, Sung-Yeon, Lee, Adam, Lu, Juanwu, Cui, Can, Jiang, Luyang, Gupta, Rohit, Han, Kyungtae, Moradipari, Ahmadreza, Wang, Ziran
–arXiv.org Artificial Intelligence
Driving scene manipulation with sensor data is emerging as a promising alternative to traditional virtual driving simulators. However, existing frameworks struggle to generate realistic scenarios efficiently due to limited editing capabilities. To address these challenges, we present SIMSplat, a predictive driving scene editor with language-aligned Gaussian splatting. As a language-controlled editor, SIMSplat enables intuitive manipulation using natural language prompts. By aligning language with Gaussian-reconstructed scenes, it further supports direct querying of road objects, allowing precise and flexible editing. Our method provides detailed object-level editing, including adding new objects and modifying the trajectories of both vehicles and pedestrians, while also incorporating predictive path refinement through multi-agent motion prediction to generate realistic interactions among all agents in the scene. Experiments on the Waymo dataset demonstrate SIMSplat's extensive editing capabilities and adaptability across a wide range of scenarios. Project page: https://sungyeonparkk.github.io/simsplat/
arXiv.org Artificial Intelligence
Oct-6-2025
- Genre:
- Research Report (0.64)
- Industry:
- Transportation > Ground > Road (0.69)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Machine Learning (1.00)
- Representation & Reasoning > Agents (0.89)
- Information Technology > Artificial Intelligence