
However,while existing models cansynthesize photorealistic images, they lack an understanding of our underlying 3D world. We present a new generative model,Visual Object Networks (VON), synthesizing natural images of objects with a disentangled 3D representation.