The Price of Bandit Information for Online Optimization

Dani, Varsha, Kakade, Sham M., Hayes, Thomas P.

Dec-31-2008–Neural Information Processing Systems

We present sharp rates of convergence (with respect to additive regret) for both the full information setting (where the cost function is revealed at the end of each round) and the bandit setting (where only the scalar cost incurred is revealed). In particular, this paper is concerned with the price of bandit information, by which we mean the ratio of the best achievable regret in the bandit setting to that in the full-information setting.

algorithm, full information case, information case, (11 more...)

Neural Information Processing Systems

Dec-31-2008

Conferences PDF

Add feedback

Country:
- North America > United States > Illinois > Cook County > Chicago (0.05)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Duplicate Docs Excel Report

Title
The Price of Bandit Information for Online Optimization
The Price of Bandit Information for Online Optimization

Similar Docs Excel Report more

Title	Similarity	Source
None found