Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes Yi Tian

Open in new window