Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds

Open in new window