Reviews: Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
–Neural Information Processing Systems
All reviews agree that the contribution is novel and strong. The rebuttal gave important answers and we all strongly defend acceptance.
Neural Information Processing Systems
Jan-22-2025, 09:43:55 GMT
- Technology: