Adam with model exponential moving average is effective for nonconvex optimization

Neural Information Processing Systems 

One of the most popular optimization algorithms is Adam [Kingma and Ba, 2014].

Similar Docs  Excel Report  more

TitleSimilaritySource
None found