Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening

Open in new window