Clustered Multi-Agent Linear Bandits

Cherkaoui, Hamza, Barlier, Merwan, Colin, Igor

Oct-30-2023–arXiv.org Machine Learning

We address in this paper a particular instance of the multi-agent linear stochastic bandit problem, called clustered multi-agent linear bandits. In this setting, we propose a novel algorithm leveraging an efficient collaboration between the agents in order to accelerate the overall optimization problem. In this contribution, a network controller is responsible for estimating the underlying cluster structure of the network and optimizing the experiences sharing among agents within the same groups. We provide a theoretical analysis for both the regret minimization problem and the clustering quality. Through empirical evaluation against state-of-the-art algorithms on both synthetic and real data, we demonstrate the effectiveness of our approach: our algorithm significantly improves regret minimization while managing to recover the true underlying cluster partitioning.

agent, algorithm, artificial intelligence, (16 more...)

arXiv.org Machine Learning

Oct-30-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America > United States
  - California (0.04)
  - New York > New York County
    - New York City (0.05)
  - Georgia > Fulton County
    - Atlanta (0.04)
  - Florida > Broward County
    - Fort Lauderdale (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Andalusia
    - Granada Province > Granada (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
- Asia
  - Singapore (0.04)
  - Macao (0.04)
  - Middle East > Israel
    - Haifa District > Haifa (0.04)
  - China > Beijing
    - Beijing (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found