Distributed Training with Heterogeneous Data: Bridging Median-and Mean-Based Algorithms

Neural Information Processing Systems 

SGD, an algorithm recently proposed to ensure robustness against Byzantine workers.