On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit

Feb-9-2024–arXiv.org Artificial Intelligence

The Stochastic Gradient Descent (SGD) is extensively used in Deep Learning (see [2]). A direct and simple proof of a convergence result of the SGD is given in [3]. Here we will go further and investigate the convergence rate of the SGD, giving some elementary proofs, for two choices of the "learning rate", both in the class of'inverse time decay' schedules.

convergence rate, hypothesis, stochastic gradient descent, (12 more...)

arXiv.org Artificial Intelligence

Feb-9-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)