Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems

Young Hun Jung, Ambuj Tewari

Neural Information Processing Systems 

Restless bandit problems are instances of non-stationary multi-armed bandits.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found