NeurIPS2021_ImperfectCommmunicationBandits
–Neural Information Processing Systems
The cooperative bandit problem is increasingly becoming relevant due to its applications in large-scale decision-making. However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays. In this paper, we study cooperative bandit learning under three typical real-world communication scenarios, namely, (a) message-passing over stochastic time-varying networks, (b) instantaneous rewardsharing over a network with random delays, and (c) message-passing with adversarially corrupted rewards, including byzantine communication. For each of these environments, we propose decentralized algorithms that achieve competitive performance, along with near-optimal guarantees on the incurred group regret as well. Furthermore, in the setting with perfect communication, we present an improved delayed-update algorithm that outperforms the existing state-of-the-art on various network topologies.
Neural Information Processing Systems
Apr-25-2026, 14:38:03 GMT
- Genre:
- Research Report (0.46)
- Technology:
- Information Technology
- Communications > Networks (0.66)
- Data Science > Data Mining
- Big Data (0.69)
- Artificial Intelligence
- Representation & Reasoning > Agents (1.00)
- Machine Learning (1.00)
- Information Technology