Cornering Stationary and Restless Mixing Bandits with Remix-UCB

Mar-12-2024, 23:13:51 GMT–Neural Information Processing Systems

We study the restless bandit problem where arms are associated with stationary ϕ-mixing processes and where rewards are therefore dependent: the question that arises from this setting is that of carefully recovering some independence by'ignoring' the values of some rewards.

algorithm, improved-ucb, remix-ucb, (14 more...)

Neural Information Processing Systems

Mar-12-2024, 23:13:51 GMT

Conferences PDF

Add feedback

Country:
- Europe > France
  - Île-de-France > Val-de-Marne
    - Cachan (0.04)
  - Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
    - Marseille (0.04)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.47)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.69)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (0.68)