Flipping-based Policy for Chance-Constrained Markov Decision Processes

Neural Information Processing Systems 

Safe reinforcement learning (RL) is a promising approach for many real-world decision-making problems where ensuring safety is a critical necessity.