Reviews: Convergent Policy Optimization for Safe Reinforcement Learning

Neural Information Processing Systems 

The reviewers found that the problem addressed in this paper is interesting. While they had some concerns regarding the overlap with prior work, these concerns were mostly addressed in the rebuttal and some reviewers therefore raised their score.