Thompson Sampling for Stochastic Bandits with Graph Feedback

Tossou, Aristide C. Y. (Chalmers University of Technology) | Dimitrakakis, Christos (University of Lille, and Chalmers University of Technology) | Dubhashi, Devdatt (Chalmers University of Technology)

Feb-14-2017–AAAI Conferences

We present a simple set of algorithms based on Thompson Sampling for stochastic bandit problems with graph feedback. Thompson Sampling is generally applicable, without the need to construct complicated upper confidence bounds. As we show in this paper, it has excellent performance in problems with graph feedback, even when the graph structure itself is unknown and/or changing. We provide theoretical guarantees on the Bayesian regret of the algorithm, as well as extensive experi- mental results on real and simulated networks. More specifically, we tested our algorithms on power law, planted partitions and Erdo's–Rényi graphs, as well as on graphs derived from Facebook and Flixster data and show that they clearly outperform related methods that employ upper confidence bounds.

artificial intelligence, big data, graph, (15 more...)

AAAI Conferences

Feb-14-2017

Conferences PDF

Add feedback

Country:
- Europe
  - France (0.14)
  - Sweden (0.14)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (1.00)
  - Communications > Social Media (0.70)
  - Data Science > Data Mining
    - Big Data (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found