Distributed Multi-Player Bandits - a Game of Thrones Approach

Nov-20-2025, 19:49:05 GMT–Neural Information Processing Systems

We consider a multi-armed bandit game where N players compete for K arms for T turns. Each player has different expected rewards for the arms, and the instantaneous rewards are independent and identically distributed. Performance is measured using the expected sum of regrets, compared to the optimal assignment of arms to players. We assume that each player only knows her actions and the reward she received each turn. Players cannot observe the actions of other players, and no communication between players is possible.

artificial intelligence, data mining, machine learning, (21 more...)

Neural Information Processing Systems

Nov-20-2025, 19:49:05 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Canada (0.04)
  - United States > California
    - Santa Clara County > Palo Alto (0.04)

Industry:
- Leisure & Entertainment (0.83)
- Media > Television (0.41)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.70)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Optimization (0.50)

Duplicate Docs Excel Report

Title
c2964caac096f26db222cb325aa267cb-Paper.pdf
Distributed Multi-Player Bandits - a Game of Thrones Approach

Similar Docs Excel Report more

Title	Similarity	Source
None found