Convex Regularization and Convergence of Policy Gradient Flows under Safety Constraints