AnImprovedAnalysisof(Variance-Reduced) Policy GradientandNaturalPolicyGradientMethods

Neural Information Processing Systems 

In this paper, we revisit and improve the convergence of policy gradient (PG), natural PG (NPG) methods, and their variance-reduced variants, under general smooth policy parametrizations.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found