Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond (Supplementary File) Pan Zhou Hanshu Y an

Neural Information Processing Systems 

It is structured as follows. Then Appendix D gives the proofs of the main results in Sec. 4, including Finally, Appendix E provides the proofs of the results in Sec. 5, including Theorems 5 and 6 which analyze the optimization error, generalization error and excess risk error of the The main limitation of this work is that the analysis in this work cannot be applicable to general nonconvex problems. This is because as explained in Sec. But as shown in Sec. 3, to bound the excess risk error, one needs to first bound In this way, our analysis cannot be applicable to general nonconvex problems. Due to space limitation, we defer more experimental results and details to this appendix.

Duplicate Docs Excel Report

Similar Docs  Excel Report  more

TitleSimilaritySource
None found