Stochastic Gradient Descent on Separable Data: Exact Convergence with a Fixed Learning Rate

Open in new window