Eigenvalue Decay Implies Polynomial-Time Learnability for Neural Networks

Feb-14-2020, 09:55:46 GMT–Neural Information Processing Systems

We consider the problem of learning function classes computed by neural networks with various activations (e.g. ReLU or Sigmoid), a task believed to be computationally intractable in the worst-case. A major open problem is to understand the minimal assumptions under which these classes admit provably efficient algorithms. In this work we show that a natural distributional assumption corresponding to {\em eigenvalue decay} of the Gram matrix yields polynomial-time algorithms in the non-realizable setting for expressive classes of networks (e.g. We make no assumptions on the structure of the network or the labels.

algorithm, distributional assumption, polynomial-time learnability, (4 more...)

Neural Information Processing Systems

Feb-14-2020, 09:55:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.80)