Stability & Generalisation of Gradient Descent for Shallow Neural Networks without the Neural Tangent Kernel

Neural Information Processing Systems 

This issue has attracted considerable attention in recent years with numerous works [Du et al., 2018, Lee et al., 2019, Allen-Zhu et al., 2019, Oymak and Soltanolkotabi, 2020] demonstrating that overparameterised shallow networks (in a sense