Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings

Open in new window