A Better Way to Pretrain Deep Boltzmann Machines

Apr-6-2023, 12:26:19 GMT–Neural Information Processing Systems

We describe how the pre-training algorithm for Deep Boltzmann Machines (DBMs) is related to the pre-training algorithm for Deep Belief Networks and we show that under certain conditions, the pre-training procedure improves the variational lower bound of a two-hidden-layer DBM. Based on this analysis, we develop a different method of pre-training DBMs that distributes the modelling work more evenly over the hidden layers. Our results on the MNIST and NORB datasets demonstrate that the new pre-training algorithm allows us to learn better generative models.

deep boltzmann machine, pre-training algorithm, pretrain deep boltzmann machine, (1 more...)

Neural Information Processing Systems

Apr-6-2023, 12:26:19 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.69)