Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes

Neural Information Processing Systems 

We study minimax optimal reinforcement learning in episodic factored Markov decision processes (FMDPs), which are MDPs with conditionally independent transition components.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found