Review for NeurIPS paper: Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

Neural Information Processing Systems 

I want to thank the authors for preparing the detailed rebuttal. This paper was discussed among all the reviewers during the post-rebuttal discussion phase. Overall, the reviewers are excited about this work on solving constrained MDP problems and have a positive assessment of the paper. All the reviewers acknowledged the theoretical contributions, especially in a challenging setting with unknown dynamics and non-stationary loss function. There was a clear consensus that the paper should be accepted.