Initialization-Dependent Sample Complexity of Linear Predictors and Neural Networks
–Neural Information Processing Systems
Clearly, in order for learning to be possible, we must impose some constraints on the size of the function class. One possibility is to bound the number of parameters (i.e., the dimensions of the matrix W), in which case learnability follows from standard VC-dimension or covering number arguments (see Anthony and Bartlett [1999]).
Neural Information Processing Systems
Oct-8-2025, 04:59:24 GMT
- Country:
- Europe
- Italy > Apulia
- Bari (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Apulia
- Europe
- Technology: