On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits

Open in new window