Analyzing Sharpness along GD Trajectory: Progressive Sharpening and Edge of Stability

Neural Information Processing Systems 

Recent findings demonstrate that modern neural networks trained by full-batch gradient descent typically enter a regime called Edge of Stability (EOS).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found