Offline Estimation of Controlled Markov Chains: Minimaxity and Sample Complexity

Open in new window