Results for non-convex: Some reviewers mention the lack of results for non-convex settings as a weakness of the
–Neural Information Processing Systems
We thank the reviewers for their comments. Paper structure: Reviewer 4 raised the concern that "the paper and the math seem unconnected at times". The proof for each has the same structure viz. Due to similarity the proofs are bundled in Appendix A. Due to similarity the proofs are bundled in Appendix B. Finally the proofs of the main theorems are bundled in Appendix C. We appreciate the positive feedback, and the pointer to confusing notation. Average iterate as opposed to last iterate is indeed done to produce a simple and generalizable analysis.
Neural Information Processing Systems
Oct-3-2025, 06:22:07 GMT
- Technology: