Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem

Open in new window