Reviews: GumBolt: Extending Gumbel trick to Boltzmann priors
–Neural Information Processing Systems
In particular it extends the dVAE (Rolfe, 2016) and dVAE (Vahdat et al., 2018) models which use a Boltzmann machine (BM) prior on the discrete latent variables by using an analogue of the'Gumbel trick' relaxation (Maddison et al., 2016; Jang et al., 2016) applied to the BM prior. The resulting model and training approach is argued to be implementationally simpler than the dVAE and dVAE approaches while also allowing the use of a tighter importance-weighted variational bound (Burda et al., 2015) which has been found to often improve training performance. The authors empirically demonstrate the efficacy of their proposed'GumBolt' approach compared to dVAE and dVAE in terms of significant improvements in test set log likelihoods on two benchmark binarized image generative model datasets (MNIST and OMNIGLOT) across a range of different architectures.
Neural Information Processing Systems
Oct-7-2024, 21:46:19 GMT
- Technology: