Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory

Open in new window