Visual Object Networks: Image Generation with Disentangled 3D Representations