Precise gradient descent training dynamics for finite-width multi-layer neural networks

Open in new window