The Impact of Local Geometry and Batch Size on the Convergence and Divergence of Stochastic Gradient Descent

Open in new window