Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners

Open in new window