Reviews: The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies
–Neural Information Processing Systems
What functions do NNs learn (approximate a function) and how fast are central questions in the study of the dynamics of (D)NNs. A common conception behind this problem is that if one trains a network longer than necessary, then the model might overfit. However, the definition of overfitting appears to vary from paper to paper. Moreover, overfitting is intimately linked with another hot topic in the area: over-parametrization. Please refer to "Advani & Saxe 2017 High Dimensional Dynamics of Gen Error for NNs" for a modern take on this link. Keeping in mind this link, we focus on fixed-size networks.
Neural Information Processing Systems
Jan-23-2025, 23:49:54 GMT
- Technology: