Gaussian-Bernoulli RBMs Without Tears

Liao, Renjie, Kornblith, Simon, Ren, Mengye, Fleet, David J., Hinton, Geoffrey

Oct-19-2022–arXiv.org Artificial Intelligence

We revisit the challenging problem of training Gaussian-Bernoulli restricted Boltzmann machines (GRBMs), introducing two innovations. We propose a novel Gibbs-Langevin sampling algorithm that outperforms existing methods like Gibbs sampling. We propose a modified contrastive divergence (CD) algorithm so that one can generate images with GRBMs starting from noise. This enables direct comparison of GRBMs with deep generative models, improving evaluation protocols in the RBM literature. Moreover, we show that modified CD and gradient clipping are enough to robustly train GRBMs with large learning rates, thus removing the necessity of various tricks in the literature. Experiments on Gaussian Mixtures, MNIST, FashionMNIST, and CelebA show GRBMs can generate good samples, despite their single-hidden-layer architecture. Our code is released at: \url{https://github.com/lrjconan/GRBM}.

artificial intelligence, grbm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Oct-19-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > New York (0.04)
  - Canada
    - Ontario > Toronto (0.14)
    - British Columbia (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Japan (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (1.00)
  - Learning Graphical Models
    - Undirected Networks > Markov Models (0.51)
    - Directed Networks > Bayesian Learning (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found