A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

Burtini, Giuseppe, Loeppky, Jason, Lawrence, Ramon

Nov-3-2015–arXiv.org Machine Learning

Adaptive and sequential experiment design is a well-studied area in numerous domains. We survey and synthesize the work of the online statistical learning paradigm referred to as multi-armed bandits integrating the existing research as a resource for a certain class of online experiments. We first explore the traditional stochastic model of a multi-armed bandit, then explore a taxonomic scheme of complications to that model, for each complication relating it to a specific requirement or consideration of the experiment design context. Finally, at the end of the paper, we present a table of known upper-bounds of regret for all studied algorithms providing both perspectives for future theoretical work and a decision-making tool for practitioners looking for theoretical guarantees.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

Nov-3-2015

arXiv.org PDF

Add feedback

Country:
- North America > Canada (0.28)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.68)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.71)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (0.46)
    - Machine Learning
      - Statistical Learning (1.00)
      - Performance Analysis > Accuracy (0.67)
      - Learning Graphical Models > Directed Networks
        Bayesian Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found