On the Learning Dynamics of Deep Neural Networks