A Variance Minimization Approach to Temporal-Difference Learning

Open in new window