The Natural Gradient by Analogy to Signal Whitening, and Recipes and Tricks for its Use
The natural gradient, as introduced by [Amari, 1987], allows for more efficient gradient descent by removing dependencies and biases inherent in a function's parameterization. Several papers present the topic thoroughly and precisely [Amari, 1987, Amari, 1998, Amari and Nagaoka, 2000, Theis, 2005, Amari, 2010]. It remains a very difficult idea to get your head around however. The intent of this note is to provide simple intuition for the natural gradient and its uses. We review how an ill conditioned parameter space can undermine learning, introduce the natural gradient by analogy to the more widely understood concept of signal whitening, and present tricks and specific prescriptions for applying the natural gradient to learning problems.
May-8-2012
- Country:
- North America > United States
- California (0.04)
- Asia
- Middle East > Jordan (0.05)
- China (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Technology: