Adaptive Variance Reduction for Stochastic Optimization under Weaker Assumptions Wei Jiang 1

Neural Information Processing Systems 

This paper explores adaptive variance reduction methods for stochastic optimization based on the STORM technique. Existing adaptive extensions of STORM rely on strong assumptions like bounded gradients and bounded function values, or suffer an additional O(log T) term in the convergence rate.