Fair Algorithms for Multi-Agent Multi-Armed Bandits

Hossain, Safwan, Micha, Evi, Shah, Nisarg

Jul-13-2020–arXiv.org Artificial Intelligence

We propose a multi-agent variant of the classical multi-armed bandit problem, in which there are N agents and K arms, and pulling an arm generates a (possibly different) stochastic reward to each agent. Unlike the classical multi-armed bandit problem, the goal is not to learn the "best arm", as each agent may perceive a different arm as best for her. Instead, we seek to learn a fair distribution over arms. Drawing on a long line of research in economics and computer science, we use the Nash social welfare as our notion of fairness. We design multi-agent variants of three classic multi-armed bandit algorithms, and show that they achieve sublinear regret, now measured in terms of the Nash social welfare.

artificial intelligence, data mining, nsw, (16 more...)

arXiv.org Artificial Intelligence

Jul-13-2020

arXiv.org PDF

Add feedback

Country:
- North America > Canada
  - Ontario > Toronto (0.28)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Africa > South Sudan
  - Equatoria > Central Equatoria > Juba (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (1.00)
  - Artificial Intelligence > Representation & Reasoning
    - Agents (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found