Increasing Batch Size Improves Convergence of Stochastic Gradient Descent with Momentum

Open in new window