TDM: From model-free to model-based deep reinforcement learning