O(1) Communication for Distributed SGD through Two-Level Gradient Averaging

Open in new window