The late-stage training dynamics of (stochastic) subgradient descent on homogeneous neural networks

Open in new window