Global Convergence of Gradient Descent for Deep Linear Residual Networks

Lei Wu, Qingcan Wang, Chao Ma

Neural Information Processing Systems 

It is motivated by avoiding stable manifolds of saddle points.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found