Optimal Regret Bounds for Collaborative Learning in Bandits

Open in new window