SUPER-ADAM: FasterandUniversalFrameworkof AdaptiveGradients

Neural Information Processing Systems 

Although multiple adaptivegradient methods were recently studied, theymainly focus oneither empirical ortheoretical aspects and also only work for specific problems by using some specific adaptive learning rates.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found