Optimal signal propagation in ResNets through residual scaling

Open in new window