Understanding Edge-of-Stability Training Dynamics with a Minimalist Example

Open in new window