Chaotic Dynamics are Intrinsic to Neural Network Training with SGD
–Neural Information Processing Systems
With the advent of deep learning over the last decade, a considerable amount of effort has gone into better understanding and enhancing Stochastic Gradient Descent so as to improve the performance and stability of artificial neural network training.
Neural Information Processing Systems
Nov-13-2025, 14:31:16 GMT