Exact solutions to the nonlinear dynamics of learning in deep linear neural networks