AdaScale SGD: A User-Friendly Algorithm for Distributed Training

Open in new window