Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Open in new window