Wormholes Improve Contrastive Divergence
Welling, Max, Mnih, Andriy, Hinton, Geoffrey E.
–Neural Information Processing Systems
In models that define probabilities via energies, maximum likelihood learning typically involves using Markov Chain Monte Carlo to sample from the model's distribution.
Neural Information Processing Systems
Dec-31-2004