Online Learning with a Hint

Dekel, Ofer, flajolet, arthur, Haghtalab, Nika, Jaillet, Patrick

Dec-31-2017–Neural Information Processing Systems

We study a variant of online linear optimization where the player receives a hint about the loss function at the beginning of each round. The hint is given in the form of a vector that is weakly correlated with the loss vector on that round. We show that the player can benefit from such a hint if the set of feasible actions is sufficiently round. Specifically, if the set is strongly convex, the hint can be used to guarantee a regret of O(log(T)), and if the set is q-uniformly convex for q\in(2,3), the hint can be used to guarantee a regret of o(sqrt{T}). In contrast, we establish Omega(sqrt{T}) lower bounds on regret when the set of feasible actions is a polyhedron.

algorithm, computer based training, educational technology, (22 more...)

Neural Information Processing Systems

Dec-31-2017

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts (0.14)

Industry:
- Education > Educational Setting > Online (0.41)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (0.97)
    - Representation & Reasoning > Optimization (0.46)
  - Enterprise Applications > Human Resources
    - Learning Management (0.41)

Duplicate Docs Excel Report

Title
Online Learning with a Hint
Online Learning with a Hint

Similar Docs Excel Report more

Title	Similarity	Source
None found