The Epochal Sawtooth Effect: Unveiling Training Loss Oscillations in Adam and Other Optimizers

Open in new window