Reviews: Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence Rates

Jan-22-2025, 09:42:32 GMT–Neural Information Processing Systems

UPDATE: I've read the other reviews and the rebuttal. I am keeping my score - this is a good paper. The study of Stochastic Gradient Descent in overparametrized setting is a popular and important trend in a recent development of huge-scale optimization for deep-learning. The authors propose a very basic and classical method, consisting from the well-known algorithmical blocks (SGD Armijo-type line search) together with its first theoretical justification under "interpolation assumption". The proof of convergence (for example, Theorem 2) mainly consists from the standard arguments (which are used for the proof of the classical non-stochastic Gradient Method under Lipschitz-continuous gradients).

artificial intelligence, assumption, machine learning, (5 more...)

Neural Information Processing Systems

Jan-22-2025, 09:42:32 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning
    - Gradient Descent (1.00)
  - Representation & Reasoning > Mathematical & Statistical Methods (0.85)