Fast Approximate Natural Gradient Descent in a Kronecker Factored Eigenbasis

Thomas George, César Laurent, Xavier Bouthillier, Nicolas Ballas, Pascal Vincent

Neural Information Processing Systems 

Stochastic Gradient Descent (SGD) and its variants are the current workhorse for training neural networks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found