On the Stability of Gradient Descent for Large Learning Rate

Open in new window