Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks

Open in new window