TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning

Wei Wen, Cong Xu, Feng Yan, Chunpeng Wu, Yandan Wang, Yiran Chen, Hai Li

Neural Information Processing Systems 

High network communication cost for synchronizing gradients and parameters is the well-known bottleneck of distributed training.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found