Goto

Collaborating Authors

 ladder variational autoencoder


Ladder Variational Autoencoders

Neural Information Processing Systems

Variational autoencoders are powerful models for unsupervised learning. However deep models with several layers of dependent stochastic variables are difficult to train which limits the improvements obtained using these highly expressive models. We propose a new inference model, the Ladder Variational Autoencoder, that recursively corrects the generative distribution by a data dependent approximate likelihood in a process resembling the recently proposed Ladder Network. We show that this model provides state of the art predictive log-likelihood and tighter log-likelihood lower bound compared to the purely bottom-up inference in layered Variational Autoencoders and other generative models. We provide a detailed analysis of the learned hierarchical latent representation and show that our new inference model is qualitatively different and utilizes a deeper more distributed hierarchy of latent variables. Finally, we observe that batch-normalization and deterministic warm-up (gradually turning on the KL-term) are crucial for training variational models with many stochastic layers.


Reviews: Ladder Variational Autoencoders

Neural Information Processing Systems

The paper is nicely written and carefully details its model and experiments. The intuition behind the combination of a bottom-up, top-down inference is interesting, though I believe the authors overstate their point when they say VAE can'only' do pure bottom-up inference. From a computational graph prospective, what the authors propose in fine is a complex network which samples the latent variables top-down, with a fair amount of skip connections (the lateral connections), hidden units with the semantic of mean and standard deviation, shared parameters between the generative model and the inference network, and some parameterless, differentiable layers (the'combination' layers of equation 18-19). It has a very nice interpretation as the authors suggest. But computationally, nothing prevents a complex inference network with top-down sampling (compatible with classical VAE framework) to carry'bottom up' in its information in its hidden units and effectively perform similar computation.


Ladder Variational Autoencoders

Neural Information Processing Systems

Variational autoencoders are powerful models for unsupervised learning. However deep models with several layers of dependent stochastic variables are difficult to train which limits the improvements obtained using these highly expressive models. We propose a new inference model, the Ladder Variational Autoencoder, that recursively corrects the generative distribution by a data dependent approximate likelihood in a process resembling the recently proposed Ladder Network. We show that this model provides state of the art predictive log-likelihood and tighter log-likelihood lower bound compared to the purely bottom-up inference in layered Variational Autoencoders and other generative models. We provide a detailed analysis of the learned hierarchical latent representation and show that our new inference model is qualitatively different and utilizes a deeper more distributed hierarchy of latent variables.


Code used Ladder Variational Autoencoders - clarification • /r/MachineLearning

@machinelearnbot

Hi, so I actually subimted it as an issue to the github repo, but maybe someone here can clarify some of the things if I do not understand them correctly, as I do not know if the authors would reply any time soon.


Ladder Variational Autoencoders

arXiv.org Machine Learning

Variational Autoencoders are powerful models for unsupervised learning. However deep models with several layers of dependent stochastic variables are difficult to train which limits the improvements obtained using these highly expressive models. We propose a new inference model, the Ladder Variational Autoencoder, that recursively corrects the generative distribution by a data dependent approximate likelihood in a process resembling the recently proposed Ladder Network. We show that this model provides state of the art predictive log-likelihood and tighter log-likelihood lower bound compared to the purely bottom-up inference in layered Variational Autoencoders and other generative models. We provide a detailed analysis of the learned hierarchical latent representation and show that our new inference model is qualitatively different and utilizes a deeper more distributed hierarchy of latent variables. Finally, we observe that batch normalization and deterministic warm-up (gradually turning on the KL-term) are crucial for training variational models with many stochastic layers.