Metric-Free Natural Gradient for Joint-Training of Boltzmann Machines

Desjardins, Guillaume, Pascanu, Razvan, Courville, Aaron, Bengio, Yoshua

Mar-16-2013–arXiv.org Machine Learning

This paper introduces the Metric-Free Natural Gradient (MFNG) algorithm for training Boltzmann Machines. Similar in spirit to the Hessian-Free method of Martens [8], our algorithm belongs to the family of truncated Newton methods and exploits an efficient matrix-vector product to avoid explicitely storing the natural gradient metric $L$. This metric is shown to be the expected second derivative of the log-partition function (under the model distribution), or equivalently, the variance of the vector of partial derivatives of the energy function. We evaluate our method on the task of joint-training a 3-layer Deep Boltzmann Machine and show that MFNG does indeed have faster per-epoch convergence compared to Stochastic Maximum Likelihood with centering, though wall-clock performance is currently not competitive.

algorithm, boltzmann machine, metric-free natural gradient, (12 more...)

arXiv.org Machine Learning

Mar-16-2013

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Colorado
    - Denver County > Denver (0.04)
  - Canada > Quebec
    - Montreal (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.90)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.96)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found