Pure Exploration with Multiple Correct Answers

Feb-9-2019–arXiv.org Machine Learning

We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensures that the Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound.

algorithm, pure exploration, track-and-stop, (13 more...)

arXiv.org Machine Learning

Feb-9-2019

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America > United States
  - New York
    - New York County > New York City (0.14)
    - Richmond County > New York City (0.04)
    - Queens County > New York City (0.04)
    - Kings County > New York City (0.04)
    - Bronx County > New York City (0.04)
  - Florida > Broward County
    - Fort Lauderdale (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Andalusia
    - Cádiz Province > Cadiz (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
- Asia > Japan
  - Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology
  - Game Theory (0.67)
  - Data Science > Data Mining
    - Big Data (0.49)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found