Mitigating Planner Overfitting in Model-Based Reinforcement Learning