A spring-block theory of feature learning in deep neural networks

Jul-27-2024–arXiv.org Machine Learning

Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 306 N Wright St, Urbana, IL 61801, USA (Dated: July 30, 2024) A central question in deep learning is how deep neural networks (DNNs) learn features. This collective effect of non-linearity, noise, learning rate, width, depth, and numerous other parameters, has eluded first-principles theories which are built from microscopic neuronal dynamics. Here we present a noise-non-linearity phase diagram that highlights where shallow or deep layers learn features more effectively. We then propose a macroscopic mechanical theory of feature learning that accurately reproduces this phase diagram, offering a clear intuition for why and how some DNNs are "lazy" and some are "active", and relating the distribution of feature learning over layers with test accuracy. Deep neural networks (DNNs) progressively compute propose a macroscopic theory of feature learning in deep, features from which the final layer generates predictions.

friction, neural network, noise, (15 more...)

arXiv.org Machine Learning

Jul-27-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Illinois > Champaign County > Urbana (0.24)
- Europe > Switzerland
  - Zürich > Zürich (0.14)
  - Basel-City > Basel (0.05)
- Asia > China
  - Anhui Province > Hefei (0.04)
- Africa > Middle East
  - Tunisia > Ben Arous Governorate > Ben Arous (0.04)

Genre:
- Research Report (0.40)

Industry:
- Energy (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found