On Regret with Multiple Best Arms

Neural Information Processing Systems 

We study a regret minimization problem with the existence of multiple best/near-optimal arms in the multi-armed bandit setting.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found