Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond (Supplementary File) Pan Zhou Hanshu Y an

Aug-18-2025, 06:43:00 GMT–Neural Information Processing Systems

It is structured as follows. Then Appendix D gives the proofs of the main results in Sec. 4, including Finally, Appendix E provides the proofs of the results in Sec. 5, including Theorems 5 and 6 which analyze the optimization error, generalization error and excess risk error of the The main limitation of this work is that the analysis in this work cannot be applicable to general nonconvex problems. This is because as explained in Sec. But as shown in Sec. 3, to bound the excess risk error, one needs to first bound In this way, our analysis cannot be applicable to general nonconvex problems. Due to space limitation, we defer more experimental results and details to this appendix.

artificial intelligence, machine learning, nullv, (16 more...)

Neural Information Processing Systems

Aug-18-2025, 06:43:00 GMT

Conferences PDF

Add feedback

Country:
- Asia
  - Singapore (0.04)
  - China > Jiangsu Province
    - Nanjing (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
SupplementaryFile

Similar Docs Excel Report more

Title	Similarity	Source
None found