Fair Algorithms for Multi-Agent Multi-Armed Bandits

Neural Information Processing Systems 

Instead, we seek to learn a fair distribution over the arms. Drawing on a long line of research in economics and computer science, we use the Nash social welfare as our notion of fairness.