New Versions of Gradient Temporal Difference Learning