Adam with Bandit Sampling for Deep Learning

Neural Information Processing Systems 

Adam is a widely used optimization method for training deep learning models.