Gradient Temporal-Difference Learning with Regularized Corrections

Open in new window