Online to Offline Conversions, Universality and Adaptive Minibatch Sizes

Kfir Levy

Neural Information Processing Systems 

Over the past years data adaptiveness has proven to be crucial to the success of learning algorithms. The objective function underlying "big data" applications often demonstrates intricate structure: the scale and smoothness are often unknown and may change substantially in between different regions/directions, [