Asynchronous Stochastic Optimization Robust to Arbitrary Delays

Neural Information Processing Systems 

While mini-batching is well understood theoretically [e.g.,