SAM 3D: 3Dfy Anything in Images
SAM 3D Team, null, Chen, Xingyu, Chu, Fu-Jen, Gleize, Pierre, Liang, Kevin J, Sax, Alexander, Tang, Hao, Wang, Weiyao, Guo, Michelle, Hardin, Thibaut, Li, Xiang, Lin, Aohan, Liu, Jiawei, Ma, Ziqi, Sagar, Anushka, Song, Bowen, Wang, Xiaodong, Yang, Jianing, Zhang, Bowen, Dollár, Piotr, Gkioxari, Georgia, Feiszli, Matt, Malik, Jitendra
–arXiv.org Artificial Intelligence
We present SAM 3D, a generative model for visually grounded 3D object reconstruction, predicting geometry, texture, and layout from a single image. SAM 3D excels in natural images, where occlusion and scene clutter are common and visual recognition cues from context play a larger role. We achieve this with a human- and model-in-the-loop pipeline for annotating object shape, texture, and pose, providing visually grounded 3D reconstruction data at unprecedented scale. We learn from this data in a modern, multi-stage training framework that combines synthetic pretraining with real-world alignment, breaking the 3D "data barrier". We obtain significant gains over recent work, with at least a 5:1 win rate in human preference tests on real-world objects and scenes. We will release our code and model weights, an online demo, and a new challenging benchmark for in-the-wild 3D object reconstruction.
arXiv.org Artificial Intelligence
Nov-21-2025
- Country:
- Asia
- China > Yunnan Province
- Kunming (0.04)
- Middle East > Jordan (0.04)
- China > Yunnan Province
- North America > United States
- Massachusetts (0.04)
- New York > New York County
- New York City (0.04)
- South America > Brazil (0.04)
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Education (0.45)
- Leisure & Entertainment (0.45)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Robots (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology