AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis

Open in new window