Nonlinear random matrix theory for deep learning

Oct-2-2024, 18:16:33 GMT–Neural Information Processing Systems

Neural network configurations with random weights play an important role in the analysis of deep learning. They define the initial loss landscape and are closely related to kernel and random feature methods. Despite the fact that these networks are built out of random matrices, the vast and powerful machinery of random matrix theory has so far found limited success in studying them. A main obstacle in this direction is that neural networks are nonlinear, which prevents the straightforward utilization of many of the existing mathematical results. In this work, we open the door for direct applications of random matrix theory to deep learning by demonstrating that the pointwise nonlinearities typically applied in neural networks can be incorporated into a standard method of proof in random matrix theory known as the moments method.

activation function, matrix, neural network, (13 more...)

Neural Information Processing Systems

Oct-2-2024, 18:16:33 GMT

Conferences PDF

Add feedback

Country:
- Europe > Russia (0.04)
- Asia > Russia (0.04)
- North America
  - United States
    - Rhode Island > Providence County
      - Providence (0.04)
    - California > Los Angeles County
      - Long Beach (0.04)
  - Canada > Ontario
    - Toronto (0.14)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
Nonlinear random matrix theory for deep learning
Nonlinear random matrix theory for deep learning

Similar Docs Excel Report more

Title	Similarity	Source
None found