DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of Plasticity

Mar-20-2026, 10:15:43 GMT–Neural Information Processing Systems

Warm-starting neural network training by initializing networks with previously learned weights is appealing, as practical neural networks are often deployed under a continuous influx of new data. However, it often leads to, where the network loses its ability to learn new information, resulting in worse generalization than training from scratch. This occurs even under stationary data distributions, and its underlying mechanism is poorly understood. We develop a framework emulating real-world neural network training and identify noise memorization as the primary cause of plasticity loss when warm-starting on stationary data.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Mar-20-2026, 10:15:43 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)