Learning from time-dependent streaming data with online stochastic algorithms
Godichon-Baggioni, Antoine, Werge, Nicklas, Wintenberger, Olivier
–arXiv.org Artificial Intelligence
This paper addresses stochastic optimization in a streaming setting with time-dependent and biased gradient estimates. We analyze several first-order methods, including Stochastic Gradient Descent (SGD), mini-batch SGD, and time-varying mini-batch SGD, along with their Polyak-Ruppert averages. Our non-asymptotic analysis establishes novel heuristics that link dependence, biases, and convexity levels, enabling accelerated convergence. Specifically, our findings demonstrate that (i) time-varying mini-batch SGD methods have the capability to break long- and short-range dependence structures, (ii) biased SGD methods can achieve comparable performance to their unbiased counterparts, and (iii) incorporating Polyak-Ruppert averaging can accelerate the convergence of the stochastic optimization algorithms. To validate our theoretical findings, we conduct a series of experiments using both simulated and real-life time-dependent data.
arXiv.org Artificial Intelligence
Jul-18-2023
- Country:
- Asia > Russia (0.04)
- North America > United States
- New York (0.04)
- Europe
- Russia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Switzerland > Neuchâtel
- Neuchâtel (0.04)
- Genre:
- Research Report > New Finding (0.86)
- Technology: