A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits

Open in new window