Delayed Gradient Averaging: Tolerate the Communication Latency in Federated Learning

Neural Information Processing Systems 

Federated Learning is an emerging direction in distributed machine learning that enables jointly training a model without sharing the data.