Reviews: Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies

Neural Information Processing Systems 

All reviews agree that the contribution is novel and strong. The rebuttal gave important answers and we all strongly defend acceptance.