An Asymptotically Optimal Batched Algorithm for the Dueling Bandit Problem

Open in new window