Best-Arm Identification in Linear Bandits

Mar-13-2024, 14:01:36 GMT–Neural Information Processing Systems

We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the G-optimality criterion used in optimal experimental design.

allocation, allocation strategy, complexity, (15 more...)

Neural Information Processing Systems

Mar-13-2024, 14:01:36 GMT

Conferences PDF

Add feedback

Country:
- Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre:
- Research Report (0.67)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.47)