Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption

Neural Information Processing Systems 

We study the infinite-horizon restless bandit problem with the average reward criterion, in both discrete-time and continuous-time settings.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found