A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs

Open in new window