Towards Understanding Generalization via Decomposing Excess Risk Dynamics