Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes

Open in new window