Mixed Optimization for Smooth Functions

Neural Information Processing Systems 

It is well known that the optimal convergence rate for stochastic optimization of smooth functions is O(1/ T), which is same as stochastic optimization of Lipschitz continuous convex functions.