Towards Understanding the Importance of Shortcut Connections in Residual Networks
–Neural Information Processing Systems
Residual Network (ResNet) is undoubtedly a milestone in deep learning. ResNet is equipped with shortcut connections between layers, and exhibits efficient training using simple first order algorithms. Despite of the great empirical success, the reason behind is far from being well understood. In this paper, we study a two-layer non-overlapping convolutional ResNet. Training such a network requires solving a non-convex optimization problem with a spurious local optimum.
Neural Information Processing Systems
May-27-2025, 12:22:01 GMT
- Technology: