Threshold Bandits, With and Without Censored Feedback

Jacob D. Abernethy, Kareem Amin, Ruihao Zhu

Neural Information Processing Systems 

We consider the Threshold Bandit setting, a variant of the classical multi-armed bandit problem in which the reward on each round depends on a piece of side information known as a threshold value .

Similar Docs  Excel Report  more

TitleSimilaritySource
None found