On Weak Regret Analysis for Dueling Bandits

Neural Information Processing Systems 

When the optimality gap is negligible, we propose another algorithm that outperforms our first algorithm, highlighting the subtlety of this dueling bandit problem.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found