QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding

Dan Alistarh, Demjan Grubic, Jerry Li, Ryota Tomioka, Milan Vojnovic

Neural Information Processing Systems 

Parallel implementations of stochastic gradient descent (SGD) have received significant research attention, thanks to its excellent scalability properties.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found