NeurIPS2021_ImperfectCommmunicationBandits

Apr-25-2026, 14:38:03 GMT–Neural Information Processing Systems

The cooperative bandit problem is increasingly becoming relevant due to its applications in large-scale decision-making. However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays. In this paper, we study cooperative bandit learning under three typical real-world communication scenarios, namely, (a) message-passing over stochastic time-varying networks, (b) instantaneous rewardsharing over a network with random delays, and (c) message-passing with adversarially corrupted rewards, including byzantine communication. For each of these environments, we propose decentralized algorithms that achieve competitive performance, along with near-optimal guarantees on the incurred group regret as well. Furthermore, in the setting with perfect communication, we present an improved delayed-update algorithm that outperforms the existing state-of-the-art on various network topologies.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Apr-25-2026, 14:38:03 GMT

Conferences PDF

Add feedback

Genre:
- Research Report (0.46)

Technology:
- Information Technology
  - Communications > Networks (0.66)
  - Data Science > Data Mining
    - Big Data (0.69)
  - Artificial Intelligence
    - Representation & Reasoning > Agents (1.00)
    - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
NeurIPS2021_ImperfectCommmunicationBandits

Similar Docs Excel Report more

Title	Similarity	Source
None found