Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening