Variance Reduced Policy Evaluation with Smooth Function Approximation

Open in new window