On the Algorithmic Stability and Generalization of Adaptive Optimization Methods

Nguyen, Han, Pham, Hai, Reddi, Sashank J., Póczos, Barnabás

Nov-7-2022–arXiv.org Artificial Intelligence

Despite their popularity in deep learning and machine learning in general, the theoretical properties of adaptive optimizers such as Adagrad, RMSProp, Adam or AdamW are not yet fully understood. In this paper, we develop a novel framework to study the stability and generalization of these optimization methods. Based on this framework, we show provable guarantees about such properties that depend heavily on a single parameter $\beta_2$. Our empirical experiments support our claims and provide practical insights into the stability and generalization properties of adaptive optimization methods.

artificial intelligence, generalization, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Nov-7-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
- Europe
  - Spain > Andalusia
    - Cádiz Province > Cadiz (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning
    - Statistical Learning (0.94)
    - Neural Networks > Deep Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found