Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality

Neural Information Processing Systems 

In this paper, we study stochastic structured bandits for minimizing regret.