Reinforcement Learning with a Terminator Guy T ennenholtz