Stochastic Variance Reduction for Policy Gradient Estimation

Open in new window