Cautious Optimizers: Improving Training with One Line of Code

Open in new window