A Two-Phase Perspective on Deep Learning Dynamics