Efficient Swap Regret Minimization in Combinatorial Bandits
Kontogiannis, Andreas, Pollatos, Vasilis, Mertikopoulos, Panayotis, Panageas, Ioannis
This paper addresses the problem of designing efficient no-swap regret algorithms for combinatorial bandits, where the number of actions $N$ is exponentially large in the dimensionality of the problem. In this setting, designing efficient no-swap regret translates to sublinear -- in horizon $T$ -- swap regret with polylogarithmic dependence on $N$. In contrast to the weaker notion of external regret minimization - a problem which is fairly well understood in the literature - achieving no-swap regret with a polylogarithmic dependence on $N$ has remained elusive in combinatorial bandits. Our paper resolves this challenge, by introducing a no-swap-regret learning algorithm with regret that scales polylogarithmically in $N$ and is tight for the class of combinatorial bandits. To ground our results, we also demonstrate how to implement the proposed algorithm efficiently -- that is, with a per-iteration complexity that also scales polylogarithmically in $N$ -- across a wide range of well-studied applications.
Feb-3-2026
- Country:
- Asia > Japan
- Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)
- Europe
- France > Auvergne-Rhône-Alpes
- Greece > Attica
- Athens (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- North America > United States
- California > Orange County > Irvine (0.04)
- Asia > Japan
- Genre:
- Research Report (0.70)
- Industry:
- Leisure & Entertainment (0.46)
- Technology: