Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training