Generalization and Optimization of SGD with Lookahead

Open in new window