Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective
–Neural Information Processing Systems
Deep residual networks (ResNets) have demonstrated better generalization performance than deep feedforward networks (FFNets). However, the theory behind such a phenomenon is still largely unknown.
Neural Information Processing Systems
Dec-23-2025, 20:01:42 GMT
- Technology: