Non
–Neural Information Processing Systems
The stochastic multi-armed bandit setting has been recently studied in the nonstationary regime, where the mean payoff of each action is a non-decreasing function of the number of rounds passed since it was last played. This model captures natural behavioral aspects of the users which crucially determine the performance of recommendation platforms, ad placement systems, and more.
Neural Information Processing Systems
Feb-10-2026, 06:23:47 GMT
- Country:
- Europe > France
- Île-de-France > Paris > Paris (0.04)
- North America > United States
- California (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Europe > France
- Technology: