Offline Estimation of Controlled Markov Chains: Minimaxity and Sample Complexity