Grad-GradaGrad? A Non-Monotone Adaptive Stochastic Gradient Method

Open in new window