Communication-efficient Distributed SGD with Sketching

Neural Information Processing Systems 

Large-scale distributed training of neural networks is often limited by network bandwidth, wherein the communication time overwhelms the local computation time.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found