SUPER-ADAM: FasterandUniversalFrameworkof AdaptiveGradients
–Neural Information Processing Systems
Although multiple adaptivegradient methods were recently studied, theymainly focus oneither empirical ortheoretical aspects and also only work for specific problems by using some specific adaptive learning rates.
Neural Information Processing Systems
Feb-8-2026, 13:34:32 GMT