Reviews: Time Matters in Regularizing Deep Networks: Weight Decay and Data Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence

Jan-25-2025, 09:21:03 GMT–Neural Information Processing Systems

The paper is well-written and the authors are clear about their claims. The idea of critical periods during training with reference to regularization is interesting. If true, this would give a different way to think about generalization. The authors have performed a number of experiments with different configurations. Although, there are deficiencies mentioned below.

artificial intelligence, regularization, regularizing deep network, (8 more...)

Neural Information Processing Systems

Jan-25-2025, 09:21:03 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Educational Setting > Preschool (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)