Adaptive Variance Reduction for Stochastic Optimization under Weaker Assumptions Wei Jiang 1, Sifan Y ang
–Neural Information Processing Systems
Problem (1) has been comprehensively investigated in the literature [Duchi et al., 2011, Kingma and Ba, 2015, Loshchilov and Hutter, 2017], and it is well-known that the classical stochastic gradient descent (SGD) achieves a convergence rate of
Neural Information Processing Systems
Feb-9-2026, 18:17:05 GMT
- Country:
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.46)
- Information Technology (0.67)
- Technology: