Federated Linear Contextual Bandits

Huang, Ruiquan, Wu, Weiqiang, Yang, Jing, Shen, Cong

Oct-27-2021–arXiv.org Machine Learning

This paper presents a novel federated linear contextual bandits model, where individual clients face different $K$-armed stochastic bandits coupled through common global parameters. By leveraging the geometric structure of the linear rewards, a collaborative algorithm called Fed-PE is proposed to cope with the heterogeneity across clients without exchanging local feature vectors or raw data. Fed-PE relies on a novel multi-client G-optimal design, and achieves near-optimal regrets for both disjoint and shared parameter cases with logarithmic communication costs. In addition, a new concept called collinearly-dependent policies is introduced, based on which a tight minimax regret lower bound for the disjoint parameter case is derived. Experiments demonstrate the effectiveness of the proposed algorithms on both synthetic and real-world datasets.

algorithm, bandit, span, (17 more...)

arXiv.org Machine Learning

Oct-27-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia (0.04)
  - Pennsylvania (0.04)
  - Florida > Broward County
    - Fort Lauderdale (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Italy > Sicily
    - Palermo (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - China (0.04)

Genre:
- Research Report > New Finding (0.45)

Industry:
- Education (1.00)
- Information Technology > Security & Privacy (0.93)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Communications (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Personal Assistant Systems (0.92)
    - Machine Learning > Statistical Learning (0.66)