323746f0ae2fbd8b6f500dc2d5c5f898-Paper-Conference.pdf
–Neural Information Processing Systems
Hence, in this infinite-width limit, it suffices that the smallest eigenvalue of the NTK is bounded away from0for gradient descent to reach zero loss.
Neural Information Processing Systems
Feb-8-2026, 05:34:52 GMT