Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction

Open in new window