c86ff2d301940fce9357de92c5222b44-Supplemental-Conference.pdf

Feb-11-2026, 21:22:48 GMT–Neural Information Processing Systems

Stochastic Gradient Descent (SGD) has been the method of choice for learning large-scale non-convex models. While a general analysis of when SGD works has been elusive, there has been a lot of recent progress in understanding the convergence of Gradient Flow (GF) on the population loss, partly due to the simplicity thatacontinuous-time analysis buysus.

artificial intelligence, machine learning, monotonically, (17 more...)

Neural Information Processing Systems

Feb-11-2026, 21:22:48 GMT

Conferences PDF

Add feedback

Country:
- Europe
  - Russia (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Russia (0.04)

Genre:
- Research Report (0.45)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)

Duplicate Docs Excel Report

Title
c86ff2d301940fce9357de92c5222b44-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found