Note on Learning Rate Schedules for Stochastic Optimization

Neural Information Processing Systems 

G is the average of an objective function over the exemplars, labeled E and X respectively.