Minimax Regret for Cascading Bandits

Neural Information Processing Systems 

Cascading bandits is a natural and popular model that frames the task of learning to rank from Bernoulli click feedback in a bandit setting.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found