Fixed Confidence Best Arm Identification in the Bayesian Setting
–Neural Information Processing Systems
We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian setting. This problem aims to find the arm of the largest mean with a fixed confidence level when the bandit model has been sampled from the known prior. Most studies on the FC-BAI problem have been conducted in the frequentist setting, where the bandit model is predetermined before the game starts. We show that the traditional FC-BAI algorithms studied in the frequentist setting, such as trackand-stop and top-two algorithms, result in arbitrarily suboptimal performances in the Bayesian setting. We also obtain a lower bound of the expected number of samples in the Bayesian setting and introduce a variant of successive elimination that has a matching performance with the lower bound up to a logarithmic factor. Simulations verify the theoretical results.
Neural Information Processing Systems
May-28-2025, 17:36:56 GMT
- Country:
- Asia > Middle East
- Israel (0.14)
- North America > United States
- Asia > Middle East
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.45)
- Leisure & Entertainment > Games (0.34)
- Technology: