What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?

Open in new window