Multi-armed Bandits: Competing with Optimal Sequences

Zohar S. Karnin, Oren Anava

Neural Information Processing Systems 

It is well-known that obtaining sublinear regret in this setting is impossible in general, which arises the question of when can we do better than linear regret?

Similar Docs  Excel Report  more

TitleSimilaritySource
None found