Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGD
–Neural Information Processing Systems
We cast this problem as stochastic convex optimization with heavy tailed stochastic gradients, and prove that the widely used Clipped-SGD algorithm attains near-optimal sub-Gaussian statistical rates whenever the second moment of the stochastic gradient noise is finite.
Neural Information Processing Systems
Dec-23-2025, 23:52:58 GMT
- Technology: