Reinforcement Learning with Quasi-Hyperbolic Discounting