Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron

Open in new window