A Variance-Reduced Cubic-Regularized Newton for Policy Optimization

Open in new window