Kaixuan Huang Yuqing Wang

Neural Information Processing Systems 

Deep residual networks (ResNets) have demonstrated better generalization performance than deep feedforward networks (FFNets). However, the theory behind such a phenomenon is still largely unknown.