Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup

Sebastian Goldt, Madhu Advani, Andrew M. Saxe, Florent Krzakala, Lenka Zdeborová

Aug-20-2025, 02:11:13 GMT–Neural Information Processing Systems

Deep neural networks behind state-of-the-art results in image classification and other domains have one thing in common: their size.

generalisation error, neural network, student, (15 more...)

Neural Information Processing Systems

Aug-20-2025, 02:11:13 GMT

Conferences PDF

Country:
- North America
  - Canada (0.04)
  - United States > Massachusetts
    - Middlesex County > Cambridge (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.14)
    - Cambridgeshire > Cambridge (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)

Genre:
- Research Report (0.68)

Industry:
- Education (0.47)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (1.00)
  - Neural Networks (1.00)

Duplicate Docs Excel Report

Title
Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup

Similar Docs Excel Report more

Title	Similarity	Source
None found