Sparsification as a Remedy for Staleness in Distributed Asynchronous SGD

Open in new window