Choice Bandits

Neural Information Processing Systems 

There has been much interest in recent years in the problem of dueling bandits, where on each round the learner plays a pair of arms and receives as feedback the outcome of a relative pairwise comparison between them.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found