Online Learning with a Hint
Ofer Dekel, arthur flajolet, Nika Haghtalab, Patrick Jaillet
–Neural Information Processing Systems
We study a variant of online linear optimization where the player receives a hint about the loss function at the beginning of each round. The hint is given in the form of a vector that is weakly correlated with the loss vector on that round. We show that the player can benefit from such a hint if the set of feasible actions is sufficiently round. Specifically, if the set is strongly convex, the hint can be used to guarantee a regret of O(log(T)), and if the set is q-uniformly convex for q (2, 3), the hint can be used to guarantee a regret of o( T). In contrast, we establish Ω( T) lower bounds on regret when the set of feasible actions is a polyhedron.
Neural Information Processing Systems
Oct-2-2024, 22:40:15 GMT
- Country:
- North America > United States (0.93)
- Industry:
- Education > Educational Setting > Online (0.51)