Ordering-based Conditions for Global Convergence of Policy Gradient Methods

Neural Information Processing Systems 

The conditions on the representation that imply global convergence are different between these two algorithms.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found