Optimal Best-arm Identification in Linear Bandits
–Neural Information Processing Systems
The objective is to identify the best arm with a given level of certainty while minimizing the sampling budget.
Neural Information Processing Systems
Oct-3-2025, 05:37:47 GMT