An Optimal Elimination Algorithm for Learning a Best Arm

Hassidim, Avinatan, Kupfer, Ron, Singer, Yaron

Jun-20-2020–arXiv.org Machine Learning

We consider the classic problem of $(\epsilon,\delta)$-PAC learning a best arm where the goal is to identify with confidence $1-\delta$ an arm whose mean is an $\epsilon$-approximation to that of the highest mean arm in a multi-armed bandit setting. This problem is one of the most fundamental problems in statistics and learning theory, yet somewhat surprisingly its worst-case sample complexity is not well understood. In this paper, we propose a new approach for $(\epsilon,\delta)$-PAC learning a best arm. This approach leads to an algorithm whose sample complexity converges to \emph{exactly} the optimal sample complexity of $(\epsilon,\delta)$-learning the mean of $n$ arms separately and we complement this result with a conditional matching lower bound. More specifically:

artificial intelligence, machine learning, sample complexity, (17 more...)

arXiv.org Machine Learning

Jun-20-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - District of Columbia > Washington (0.04)
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
  - Arizona > Maricopa County
    - Phoenix (0.04)
- Europe
  - United Kingdom > Scotland
    - City of Edinburgh > Edinburgh (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Porto
    - Porto (0.04)
- Asia > Middle East
  - Israel > Haifa District > Haifa (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.89)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found