Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
–Neural Information Processing Systems
We study the infinite-horizon restless bandit problem with the average reward criterion, in both discrete-time and continuous-time settings.
Neural Information Processing Systems
Feb-9-2026, 08:47:04 GMT
- Country:
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > North Holland
- North America > United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Pennsylvania > Allegheny County
- Europe
- Technology: