323746f0ae2fbd8b6f500dc2d5c5f898-Paper-Conference.pdf

Neural Information Processing Systems 

Hence, in this infinite-width limit, it suffices that the smallest eigenvalue of the NTK is bounded away from0for gradient descent to reach zero loss.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found