Efficient Swap Regret Minimization in Combinatorial Bandits

Kontogiannis, Andreas, Pollatos, Vasilis, Mertikopoulos, Panayotis, Panageas, Ioannis

Feb-3-2026–arXiv.org Machine Learning

This paper addresses the problem of designing efficient no-swap regret algorithms for combinatorial bandits, where the number of actions $N$ is exponentially large in the dimensionality of the problem. In this setting, designing efficient no-swap regret translates to sublinear -- in horizon $T$ -- swap regret with polylogarithmic dependence on $N$. In contrast to the weaker notion of external regret minimization - a problem which is fairly well understood in the literature - achieving no-swap regret with a polylogarithmic dependence on $N$ has remained elusive in combinatorial bandits. Our paper resolves this challenge, by introducing a no-swap-regret learning algorithm with regret that scales polylogarithmically in $N$ and is tight for the class of combinatorial bandits. To ground our results, we also demonstrate how to implement the proposed algorithm efficiently -- that is, with a per-iteration complexity that also scales polylogarithmically in $N$ -- across a wide range of well-studied applications.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

Feb-3-2026

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Orange County > Irvine (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
  - Greece > Attica
    - Athens (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)
    - Lyon > Lyon (0.04)
- Asia > Japan
  - Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)

Genre:
- Research Report (0.70)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found