On Regret-Optimal Learning in Decentralized Multi-player Multi-armed Bandits

Open in new window