The Implicit Bias of Adam on Separable Data

Neural Information Processing Systems 

Adam has become one of the most favored optimizers in deep learning problems.