Bounded Regret for Finite-Armed Structured Bandits

Dec-31-2014–Neural Information Processing Systems

We study a new type of K-armed bandit problem where the expected return of one arm may depend on the returns of other arms. We present a new algorithm for this general class of problems and show that under certain circumstances it is possible to achieve finite expected cumulative regret. We also give problem-dependent lower bounds on the cumulative regret showing that at least in special cases the new algorithm is nearly optimal.

artificial intelligence, big data, finite regret, (20 more...)

Neural Information Processing Systems

Dec-31-2014

Conferences PDF

Add feedback

Country:
- North America > Canada > Alberta (0.14)

Genre:
- Research Report (0.46)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.51)

Duplicate Docs Excel Report

Title
Bounded Regret for Finite-Armed Structured Bandits
Bounded Regret for Finite-Armed Structured Bandits

Similar Docs Excel Report more

Title	Similarity	Source
None found