Multi-armed Bandits: Competing with Optimal Sequences

Mar-23-2026, 08:16:30 GMT–Neural Information Processing Systems

We consider sequential decision making problem in the adversarial setting, where regret is measured with respect to the optimal sequence of actions and the feedback adheres the bandit setting. It is well-known that obtaining sublinear regret in this setting is impossible in general, which arises the question of when can we do better than linear regret? Previous works show that when the environment is guaranteed to vary slowly and furthermore we are given prior knowledge regarding its variation (i.e., a limit on the amount of changes suffered by the environment), then this task is feasible. The caveat however is that such prior knowledge is not likely to be available in practice, which causes the obtained regret bounds to be somewhat irrelevant. Our main result is a regret guarantee that scales with the variation parameter of the environment, without requiring any prior knowledge about it whatsoever. By that, we also resolve an open problem posted by Gur, Zeevi and Besbes [8]. An important key component in our result is a statistical test for identifying non-stationarity in a sequence of independent random variables. This test either identifies nonstationarity or upper-bounds the absolute deviation of the corresponding sequence of mean values in terms of its total variation. This test is interesting on its own right and has the potential to be found useful in additional settings.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Mar-23-2026, 08:16:30 GMT

Conferences PDF

Add feedback

Genre:
- Research Report (0.34)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.83)

Duplicate Docs Excel Report

Title
Multi-armed Bandits: Competing with Optimal Sequences Yahoo! Research Berkeley, CA
Multi-armed Bandits: Competing with Optimal Sequences

Similar Docs Excel Report more

Title	Similarity	Source
None found