Deep Reinforcement Learning with Gradient Eligibility Traces