Learning Powerful Policies by Using Consistent Dynamics Model