Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGD
–Neural Information Processing Systems
We cast this problem as stochastic convex optimization with heavy tailed stochastic gradients, and prove that the widely used Clipped-SGD algorithm attains near-optimal sub-Gaussian statistical rates whenever the second moment of the stochastic gradient noise is finite.
Neural Information Processing Systems
Mar-18-2026, 04:53:31 GMT
- Technology: