Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

Open in new window