Regularization Implies balancedness in the deep linear network

Nov-4-2025–arXiv.org Machine Learning

We use geometric invariant theory (GIT) to study the deep linear network (DLN). The Kempf-Ness theorem is used to establish that the $L^2$ regularizer is minimized on the balanced manifold. This allows us to decompose the training dynamics into two distinct gradient flows: a regularizing flow on fibers and a learning flow on the balanced manifold. We show that the regularizing flow is exactly solvable using the moment map. This approach provides a common mathematical framework for balancedness in deep learning and linear systems theory. We use this framework to interpret balancedness in terms of model reduction and Bayesian principles.

artificial intelligence, equation, machine learning, (13 more...)

arXiv.org Machine Learning

Nov-4-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- North America > United States
  - Indiana (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Rhode Island > Providence County
    - Providence (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found