Global Convergence of Gradient Descent for Deep Linear Residual Networks