Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits

He, Yuchen, Ye, Zichun, Zhang, Chihao

arXiv.org Machine Learning 

We study the stochastic multi-armed bandit problem in the $P$-pass streaming model. In this problem, the $n$ arms are present in a stream and at most $m

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found