Achieving the fundamental convergence-communication tradeoff with Differentially Quantized Gradient Descent

Open in new window