Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks

Guodong Zhang, James Martens, Roger B. Grosse

Neural Information Processing Systems 

We further extend our analysis to more general loss functions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found