Communication trade-offs for synchronized distributed SGD with large step size

Open in new window