Improving the convergence of SGD through adaptive batch sizes

Open in new window