Review for NeurIPS paper: Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition

Open in new window