Minimax Regret for Cascading Bandits
–Neural Information Processing Systems
Cascading bandits is a natural and popular model that frames the task of learning to rank from Bernoulli click feedback in a bandit setting.
Neural Information Processing Systems
Feb-11-2026, 15:13:11 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Illinois (0.04)
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.46)
- Technology: