c8cc6e90ccbff44c9cee23611711cdc4-AuthorFeedback.pdf
–Neural Information Processing Systems
We thank the reviewers for their work and for the positive evaluation of our paper. Reviewer 3. 1. "the main theorems (Theorems 1 and 2) need a small step size, similar to previous works. In fact17 Safran and Shamir (2020) show that convergence is only possible for step sizeO(1/n)" Firstly, we disagree about18 Theorem 1--even with step size 1L it guarantees convergence to a neighborhood. Wewilladdtheseclarifications.24 2. "the dependence onµ has worsened. In particular, Nagaraj et al. (2019) give an error rate ofκ2/µ" This25 comparison isnot fair because Nagaraj etal.
Neural Information Processing Systems
Feb-10-2026, 08:14:33 GMT