The Sliding Regret in Stochastic Bandits: Discriminating Index and Randomized Policies

Open in new window