Implicit Bias of Gradient Descent on Reparametrized Models: On Equivalence to Mirror Descent Zhiyuan Li

Aug-19-2025, 12:33:22 GMT–Neural Information Processing Systems

As part of the effort to understand implicit bias of gradient descent in over-parametrized models, several results have shown how the training trajectory on the overparametrized model can be understood as mirror descent on a different objective. The main result here is a characterization of this phenomenon under a notion termed commuting parametrization, which encompasses all the previous results in this setting.

artificial intelligence, init, machine learning, (16 more...)

Neural Information Processing Systems

Aug-19-2025, 12:33:22 GMT

Conferences PDF

Add feedback

Country:
- Europe > Russia (0.04)
- North America > United States
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Connecticut > New Haven County
    - New Haven (0.04)
- Asia
  - Russia (0.04)
  - Japan > Honshū
    - Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre:
- Research Report (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.71)

Duplicate Docs Excel Report

Title
dfa1106ea7065899b13f2be9da04efb4-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found