Simplifying Deep Temporal Difference Learning