Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Dieuleveut, Aymeric, Fort, Gersende, Moulines, Eric, Wai, Hoi-To

Jul-16-2023–arXiv.org Machine Learning

Stochastic Approximation (SA) is a classical algorithm that has had since the early days a huge impact on signal processing, and nowadays on machine learning, due to the necessity to deal with a large amount of data observed with uncertainties. An exemplar special case of SA pertains to the popular stochastic (sub)gradient algorithm which is the working horse behind many important applications. A lesser-known fact is that the SA scheme also extends to non-stochastic-gradient algorithms such as compressed stochastic gradient, stochastic expectation-maximization, and a number of reinforcement learning algorithms. The aim of this article is to overview and introduce the non-stochastic-gradient perspectives of SA to the signal processing and machine learning audiences through presenting a design guideline of SA algorithms backed by theories. Our central theme is to propose a general framework that unifies existing theories of SA, including its non-asymptotic and asymptotic convergence results, and demonstrate their applications on popular non-stochastic-gradient algorithms. We build our analysis framework based on classes of Lyapunov functions that satisfy a variety of mild conditions. We draw connections between non-stochastic-gradient algorithms and scenarios when the Lyapunov function is smooth, convex, or strongly convex. Using the said framework, we illustrate the convergence properties of the non-stochastic-gradient algorithms using concrete examples. Extensions to the emerging variance reduction techniques for improved sample complexity will also be discussed.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Machine Learning

Jul-16-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Netherlands > South Holland
    - Dordrecht (0.04)
  - France > Occitanie
    - Haute-Garonne > Toulouse (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - India (0.04)
  - China > Hong Kong (0.04)

Genre:
- Overview (1.00)
- Research Report > New Finding (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Mathematical & Statistical Methods (1.00)
  - Machine Learning
    - Statistical Learning > Gradient Descent (1.00)
    - Reinforcement Learning (1.00)
    - Learning Graphical Models
      - Undirected Networks > Markov Models (0.67)
      - Directed Networks > Bayesian Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found