Deep Convolutional Inverse Graphics Network

Tejas D. Kulkarni, William F. Whitney, Pushmeet Kohli, Josh Tenenbaum

Feb-8-2025, 02:32:20 GMT–Neural Information Processing Systems

This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that aims to learn an interpretable representation of images, disentangled with respect to three-dimensional scene structure and viewing transformations such as depth rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm [10]. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g.

artificial intelligence, machine learning, representation, (16 more...)

Neural Information Processing Systems

Feb-8-2025, 02:32:20 GMT

Conferences PDF

Add feedback