Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Dec-23-2025, 23:31:16 GMT–Neural Information Processing Systems

Understanding the training dynamics of deep learning models is perhaps a necessary step toward demystifying the effectiveness of these models. In particular, how do training data from different classes gradually become separable in their feature spaces when training neural networks using stochastic gradient descent?

elastic stochastic differential equation, imitating deep learning dynamic, name change, (6 more...)

Neural Information Processing Systems

Dec-23-2025, 23:31:16 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.65)
  - Statistical Learning > Gradient Descent (0.59)