Exponential Concentration of Stochastic Approximation with Non-vanishing Gradient

Aug-3-2023–arXiv.org Artificial Intelligence

We consider stochastic approximation algorithms where the expected progress toward the optimum is proportional to the algorithm's step size. For instance, a stochastic gradient descent algorithm applied to a convex function will satisfy this property when bounded away from the optimum. This property can continue to hold when the optimum is on the corner of a convex constraint set or when the function is not smooth at the optimum or when the objective function lies within a cone. In such settings, we will show that a projected stochastic gradient descent algorithm can have a different rate of convergence than would be anticipated by standard results for stochastic gradient descent with a smooth objective and smooth constraints. We develop new results whose methods are typically used in probability to analyze random walks or in applied probability to analyze queueing networks. For stochastic approximation, our results establish new exponential concentration bounds. We now summarize the background and problems where our results apply.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Artificial Intelligence

Aug-3-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - New York > New York County
    - New York City (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
    - Belmont (0.04)
- Europe > United Kingdom
  - England
    - Greater London > London (0.04)
    - Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.94)
  - Machine Learning > Statistical Learning
    - Gradient Descent (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found