Deep Convolutional Inverse Graphics Network
Tejas D. Kulkarni, William F. Whitney, Pushmeet Kohli, Josh Tenenbaum
–Neural Information Processing Systems
This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that aims to learn an interpretable representation of images, disentangled with respect to three-dimensional scene structure and viewing transformations such as depth rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm [10]. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g.
Neural Information Processing Systems
Feb-8-2025, 02:32:20 GMT