Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks