A Distributed Hierarchical SGD Algorithm with Sparse Global Reduction