Review for NeurIPS paper: Tight Nonparametric Convergence Rates for Stochastic Gradient Descent under the Noiseless Linear Model
–Neural Information Processing Systems
Weaknesses: While the paper is written very clearly, there are several questions I'd like to raise. Firstly, in discussing the applicability of the results the paper mentions'some basic vision or sound recognition tasks' (line 33) - I'd like to ask about some examples of such tasks. Looking at the statement of the Theorem 1, seems that it should be applicable in finite-dimensional spaces with invertible covariance matrices. If it is so, then I do not understand the results. In particular, for X distributed with a finite support and has identity covariance matrix, the conditions (a) and (b) hold for arbitrarily large positive \alpha, however the theorem statement implies that the estimates will go to zero at an arbitrarily large polynomial rate, which is not true.
Neural Information Processing Systems
Jan-22-2025, 06:22:48 GMT
- Technology: