Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination

Neural Information Processing Systems 

This paper investigates the problem of best arm identification in contaminated stochastic multi-arm bandits.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found