Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes Yi Tian
–Neural Information Processing Systems
We study minimax optimal reinforcement learning in episodic factored Markov decision processes (FMDPs), which are MDPs with conditionally independent transition components.
Neural Information Processing Systems
Feb-10-2026, 21:32:26 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada (0.04)
- United States > Massachusetts
- Middlesex County > Cambridge (0.14)
- Asia > Middle East