Value Gradient weighted Model-Based Reinforcement Learning