Smoothness, Low Noise and Fast Rates
Srebro, Nathan, Sridharan, Karthik, Tewari, Ambuj
–Neural Information Processing Systems
We establish an excess risk bound of O(H R_n^2 + sqrt{H L*} R_n) for ERM with an H-smooth loss function and a hypothesis class with Rademacher complexity R_n, where L* is the best risk achievable by the hypothesis class. For typical hypothesis classes where R_n = sqrt{R/n}, this translates to a learning rate of ̃ O(RH/n) in the separable (L* = 0) case and O(RH/n + sqrt{L* RH/n}) more generally. We also provide similar guarantees for online and stochastic convex optimization of a smooth non-negative objective.
Neural Information Processing Systems
Dec-31-2010
- Country:
- Asia > Middle East
- Israel > Jerusalem District
- Jerusalem (0.04)
- Jordan (0.04)
- Israel > Jerusalem District
- Europe
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Russia > Central Federal District
- North America > United States
- Illinois > Cook County
- Chicago (0.04)
- Texas > Travis County
- Austin (0.04)
- Illinois > Cook County
- Asia > Middle East
- Technology: