Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits

Open in new window