Locality defeats the curse of dimensionality in convolutional teacher-student scenarios

Oct-10-2024, 07:58:26 GMT–Neural Information Processing Systems

Convolutional neural networks perform a local and translationally-invariant treatment of the data: quantifying which of these two aspects is central to their success remains a challenge. We study this problem within a teacher-student framework for kernel regression, using'convolutional' kernels inspired by the neural tangent kernel of simple convolutional architectures of given filter size. Using heuristic methods from physics, we find in the ridgeless case that locality is key in determining the learning curve exponent \beta (that relates the test error \epsilon_t\sim P {-\beta} to the size of the training set P), whereas translational invariance is not. In particular, if the filter size of the teacher t is smaller than that of the student s, \beta is a function of s only and does not depend on the input dimension. We confirm our predictions on \beta empirically.

convolutional teacher-student scenario, dimensionality, locality defeat, (3 more...)

Neural Information Processing Systems

Oct-10-2024, 07:58:26 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (0.64)
  - Learning in High Dimensional Spaces (0.40)