TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control