An Optimal Elimination Algorithm for Learning a Best Arm Avinatan Hassidim

Jan-26-2025, 01:03:48 GMT–Neural Information Processing Systems

We consider the classic problem of (ɛ, δ)-PAC learning a best arm where the goal is to identify with confidence 1 δ an arm whose mean is an ɛ-approximation to that of the highest mean arm in a multi-armed bandit setting. This problem is one of the most fundamental problems in statistics and learning theory, yet somewhat surprisingly its worst case sample complexity is not well understood. In this paper we propose a new approach for (ɛ, δ)-PAC learning a best arm. This approach leads to an algorithm whose sample complexity converges to exactly the optimal sample complexity of (ɛ, δ)-learning the mean of n arms separately and we complement this result with a conditional matching lower bound.

artificial intelligence, machine learning, sample complexity, (17 more...)

Neural Information Processing Systems

Jan-26-2025, 01:03:48 GMT

Conferences PDF

Add feedback

Country:
- Europe (1.00)
- North America > United States (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.90)