Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning
Elhafsi, Amine, Morton, Daniel, Pavone, Marco
–arXiv.org Artificial Intelligence
Autonomous robots must reason about the physical consequences of their actions to operate effectively in unstructured, real-world environments. We present Scan, Materialize, Simulate (SMS), a unified framework that combines 3D Gaussian Splatting for accurate scene reconstruction, visual foundation models for semantic segmentation, vision-language models for material property inference, and physics simulation for reliable prediction of action outcomes. By integrating these components, SMS enables generalizable physical reasoning and object-centric planning without the need to re-learn foundational physical dynamics. We empirically validate SMS in a billiards-inspired manipulation task and a challenging quadrotor landing scenario, demonstrating robust performance on both simulated domain transfer and real-world experiments. Our results highlight the potential of bridging differentiable rendering for scene reconstruction, foundation models for semantic understanding, and physics-based simulation to achieve physically grounded robot planning across diverse settings.
arXiv.org Artificial Intelligence
May-22-2025
- Country:
- Asia
- Japan > Honshū
- Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Middle East > Jordan (0.04)
- Japan > Honshū
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Leisure & Entertainment > Sports (0.46)
- Transportation (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.68)
- Natural Language > Large Language Model (0.93)
- Robots (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence