Review for NeurIPS paper: On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

Jan-21-2025, 11:44:08 GMT–Neural Information Processing Systems

Weaknesses: There are a lot similar results in slightly different regime, which makes this work looks incremental. In the case of GD, this Morse assumption can be resolved by using a stronger stable manifold theorem in "Michael Shub. I suspect a similar combination might go through here? Usually one view asymptotic results (this paper) weaker than non-asymptotic results (earlier papers), it is also not clear from this paper if one can obtain probability 1 result by modifying the existing high probability result with Borel Cantelli lemma and a bit extra work.

neurips paper, non-convex problem, stochastic gradient descent, (3 more...)

Neural Information Processing Systems

Jan-21-2025, 11:44:08 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.85)