Clustered Multi-Agent Linear Bandits
Cherkaoui, Hamza, Barlier, Merwan, Colin, Igor
We address in this paper a particular instance of the multi-agent linear stochastic bandit problem, called clustered multi-agent linear bandits. In this setting, we propose a novel algorithm leveraging an efficient collaboration between the agents in order to accelerate the overall optimization problem. In this contribution, a network controller is responsible for estimating the underlying cluster structure of the network and optimizing the experiences sharing among agents within the same groups. We provide a theoretical analysis for both the regret minimization problem and the clustering quality. Through empirical evaluation against state-of-the-art algorithms on both synthetic and real data, we demonstrate the effectiveness of our approach: our algorithm significantly improves regret minimization while managing to recover the true underlying cluster partitioning.
Oct-30-2023
- Country:
- Asia
- China > Beijing
- Beijing (0.04)
- Macao (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Singapore (0.04)
- China > Beijing
- Europe
- Finland > Uusimaa
- Helsinki (0.04)
- France (0.04)
- Spain > Andalusia
- Granada Province > Granada (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Finland > Uusimaa
- North America > United States
- California (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- New York > New York County
- New York City (0.05)
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia
- Genre:
- Research Report (0.50)
- Technology: