Improving Model-Based Reinforcement Learning by Converging to Flatter Minima

Open in new window