Learning curves for the multi-class teacher-student perceptron

Cornacchia, Elisabetta, Mignacco, Francesca, Veiga, Rodrigo, Gerbelot, Cédric, Loureiro, Bruno, Zdeborová, Lenka

Mar-22-2022–arXiv.org Artificial Intelligence

One of the most classical results in high-dimensional learning theory provides a closed-form expression for the generalisation error of binary classification with the single-layer teacher-student perceptron on i.i.d. Gaussian inputs. Both Bayes-optimal estimation and empirical risk minimisation (ERM) were extensively analysed for this setting. At the same time, a considerable part of modern machine learning practice concerns multi-class classification. Yet, an analogous analysis for the corresponding multi-class teacher-student perceptron was missing. In this manuscript we fill this gap by deriving and evaluating asymptotic expressions for both the Bayes-optimal and ERM generalisation errors in the high-dimensional regime. For Gaussian teacher weights, we investigate the performance of ERM with both cross-entropy and square losses, and explore the role of ridge regularisation in approaching Bayes-optimality. In particular, we observe that regularised cross-entropy minimisation yields close-to-optimal accuracy. Instead, for a binary teacher we show that a first-order phase transition arises in the Bayes-optimal performance.

artificial intelligence, equation, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-22-2022

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - São Paulo (0.04)
- North America > United States
  - New York > New York County > New York City (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
  - Switzerland > Vaud
    - Lausanne (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.81)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found