The regret lower bound for communicating Markov Decision Processes

Open in new window