Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

Open in new window