Asymptotically optimal regret in communicating Markov decision processes