Best-Arm Identification in Linear Bandits

Marta Soare, Alessandro Lazaric, Remi Munos

Feb-10-2025, 00:11:18 GMT–Neural Information Processing Systems

We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the G-optimality criterion used in optimal experimental design.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Feb-10-2025, 00:11:18 GMT

Conferences PDF

Add feedback

Country:
- Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre:
- Research Report (0.67)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.47)

Duplicate Docs Excel Report

Title
Best-Arm Identification in Linear Bandits
Best-Arm Identification in Linear Bandits
Best-Arm Identification in Linear Bandits

Similar Docs Excel Report more

Title	Similarity	Source
None found