Stochastic Gradient Descent with Large Learning Rate

Open in new window