Online Non-Convex Learning: Following the Perturbed Leader is Optimal
Suggala, Arun Sai, Netrapalli, Praneeth
We study the problem of online learning with non-convex losses, where the learner has access to an offline optimization oracle. We show that the classical Follow the Perturbed Leader (FTPL) algorithm achieves optimal regret rate of $O(T^{-1/2})$ in this setting. This improves upon the previous best-known regret rate of $O(T^{-1/3})$ for FTPL. We further show that an optimistic variant of FTPL achieves better regret bounds when the sequence of losses encountered by the learner is `predictable'.
Mar-19-2019
- Country:
- Asia > India (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Genre:
- Research Report (0.64)
- Technology: