Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization

Open in new window