Reviews: On Lazy Training in Differentiable Programming

Jan-26-2025, 11:14:55 GMT–Neural Information Processing Systems

The paper provided some interesting understanding, but is not significant enough to explain interesting issues in deep learning. The paper showed that lazy training can be caused by parameter scaling, not special to overparameterization of neural networks. What does this tell us about the overparameterized neural networks? Does this result imply that lazy regime of overparameterized neural networks is necessarily due to parameter scaling? If not, lazy regime of overparameterized neural networks cannot be explained simply by parameter scaling.

differentiable programming, overparameterized neural network, parameter scaling, (4 more...)

Neural Information Processing Systems

Jan-26-2025, 11:14:55 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)