On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method

Neural Information Processing Systems 

In this paper, a simple gradient truncation mechanism is proposed to address this issue.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found