Implicit Bias of AdamW: $\ell_\infty$ Norm Constrained Optimization

Open in new window