Ordering-based Conditions for Global Convergence of Policy Gradient Methods

Neural Information Processing Systems 

The conditions on the representation that imply global convergence are different between these two algorithms.