Reviews: Continuous-time Models for Stochastic Optimization Algorithms

Neural Information Processing Systems 

I have read the rebuttal and I believe the authors have satisfactorily addressed my comments on prior work, so I have increased my rating. The SDE approximation method is well-established. Moreover, Minibatch SGD's continuous approximation has been considered by several prior works, e.g. Summary and review comments: The paper is well-written and one of its strengths in generally good comparison with prior work. The main theoretical results are: * SDE approximation for minibatch SGD and SVRG * Well-posedness of the SDEs * Matching convergence bounds using Lyapunov functions * Interpreting time-dependent adjustments as time-change and landscape-stretching.