Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced

Nov-20-2025, 23:17:42 GMT–Neural Information Processing Systems

We study the implicit regularization imposed by gradient descent for learning multi-layer homogeneous functions including feed-forward fully connected and convolutional deep neural networks with linear, ReLU or Leaky ReLU activation. We rigorously prove that gradient flow (i.e.

algorithmic regularization, gradient descent, learning deep homogeneous model, (4 more...)

Neural Information Processing Systems

Nov-20-2025, 23:17:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)