Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks
Yuanzhi Li, Colin Wei, Tengyu Ma
–Neural Information Processing Systems
Neural Information Processing Systems
Feb-13-2026, 21:03:00 GMT
- Technology: