Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks
Yuanzhi Li, Colin Wei, Tengyu Ma
–Neural Information Processing Systems
Neural Information Processing Systems
Aug-20-2025, 00:46:33 GMT
- Country:
- Technology: