Learning to Infer 3D Object Models from Images
Chen, Chang, Deng, Fei, Ahn, Sungjin
A crucial ability of human intelligence is to build up models of individual 3D objects from partial scene observations. Recent works achieve object-centric generation but without the ability to infer the representation, or achieve 3D scene representation learning but without object-centric compositionality. Therefore, learning to represent and render 3D scenes with object-centric compositionality remains elusive. In this paper, we propose a probabilistic generative model for learning to build modular and compositional 3D object models from partial observations of a multi-object scene. The proposed model can (i) infer the 3D object representations by learning to search and group object areas and also (ii) render from an arbitrary viewpoint not only individual objects but also the full scene by compositing the objects. The entire learning process is unsupervised and end-to-end. In experiments, in addition to generation quality, we also demonstrate that the learned representation permits object-wise manipulation and novel scene generation, and generalizes to various settings. Results can be found on our project website: https://sites.google.com/view/roots3d
Jun-10-2020
- Country:
- North America > United States
- New Jersey > Middlesex County > Piscataway (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- United Kingdom > England
- Asia > Japan
- Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Cognitive Science (0.87)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.68)
- Information Technology > Artificial Intelligence