Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits
He, Yuchen, Ye, Zichun, Zhang, Chihao
We study the stochastic multi-armed bandit problem in the $P$-pass streaming model. In this problem, the $n$ arms are present in a stream and at most $m
Jul-6-2024
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > China
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Education (0.46)
- Technology: