Convex Regularization and Convergence of Policy Gradient Flows under Safety Constraints

Open in new window