SGD with Adaptive Preconditioning: Unified Analysis and Momentum Acceleration

Open in new window