Adaptive Gradient Quantization for Data-Parallel SGD

Open in new window