On the convergence of gradient descent for two layer neural networks

Open in new window