Convergence of continuous-time stochastic gradient descent with applications to linear deep neural networks

Sep-11-2024–arXiv.org Machine Learning

We study a continuous-time approximation of the stochastic gradient descent process for minimizing the expected loss in learning problems. The main results establish general sufficient conditions for the convergence, extending the results of Chatterjee (2022) established for (nonstochastic) gradient descent. We show how the main result can be applied to the case of overparametrized linear neural network training.

gradient descent, neural network, stochastic gradient descent, (13 more...)

arXiv.org Machine Learning

Sep-11-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre:
- Research Report (0.82)

Industry:
- Education (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (1.00)
  - Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found