On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit
Anita, Stefana, Turinici, Gabriel
–arXiv.org Artificial Intelligence
The Stochastic Gradient Descent (SGD) is extensively used in Deep Learning (see [2]). A direct and simple proof of a convergence result of the SGD is given in [3]. Here we will go further and investigate the convergence rate of the SGD, giving some elementary proofs, for two choices of the "learning rate", both in the class of'inverse time decay' schedules.
arXiv.org Artificial Intelligence
Feb-9-2024
- Country:
- Genre:
- Research Report (0.40)
- Technology: