Multi-Armed Bandits with Bounded Arm-Memory: Near-Optimal Guarantees for Best-Arm Identification and Regret Minimization

Neural Information Processing Systems 

We study the Stochastic Multi-armed Bandit problem under bounded arm-memory.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found