Long-Tailed Learning Requires Feature Learning

Laurent, Thomas, von Brecht, James H., Bresson, Xavier

Dec-28-2022–arXiv.org Artificial Intelligence

Part of the motivation for deploying a neural network arises from the belief that algorithms that learn features/representations generalize better than algorithms that do not. We try to give some mathematical ballast to this notion by studying a data model where, at an intuitive level, a learner succeeds if and only if it manages to learn the correct features. The data model itself attempts to capture two key structures observed in natural data such as text or images. First, it is endowed with a latent structure at the patch or word level that is directly tied to a classification task. Second, the data distribution has a long-tail, in the sense that rare and uncommon instances collectively form a significant fraction of the data. We derive non-asymptotic generalization error bounds that quantify, within our framework, the penalty that one must pay for not learning features.

artificial intelligence, latexit sha1, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Dec-28-2022

arXiv.org PDF

Add feedback

Country:
- Asia > Singapore (0.04)

Genre:
- Research Report (0.81)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Statistical Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found