Closing the gap between the upper bound and lower bound of Adam's iteration complexity

Jan-19-2025, 09:57:34 GMT–Neural Information Processing Systems

Recently, Arjevani et al. [1] establish a lower bound of iteration complexity for the first-order optimization under an L -smooth condition and a bounded noise variance assumption. However, a thorough review of existing literature on Adam's convergence reveals a noticeable gap: none of them meet the above lower bound. In this paper, we close the gap by deriving a new convergence guarantee of Adam, with only an L -smooth condition and a bounded noise variance assumption. Our results remain valid across a broad spectrum of hyperparameters. Especially with properly chosen hyperparameters, we derive an upper bound of the iteration complexity of Adam and show that it meets the lower bound for first-order optimizers.

bounded noise variance assumption, closing, iteration complexity, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 09:57:34 GMT

Conferences Web Page

Add feedback

Country:
- Asia > China > Guangxi Province > Nanning (0.09)

Genre:
- Research Report (0.44)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.43)