On the Power and Limitations of Random Features for Understanding Neural Networks

Dec-25-2025, 09:51:00 GMT–Neural Information Processing Systems

Recently, a spate of papers have provided positive theoretical results for training over-parameterized neural networks (where the network size is larger than what is needed to achieve low error). The key insight is that with sufficient over-parameterization, gradient-based methods will implicitly leave some components of the network relatively unchanged, so the optimization dynamics will behave as if those components are essentially fixed at their initial random values. In fact, fixing these \emph{explicitly} leads to the well-known approach of learning with random features (e.g.

neural network, power and limitation, random feature, (6 more...)

Neural Information Processing Systems

Dec-25-2025, 09:51:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)