Flipping-based Policy for Chance-Constrained Markov Decision Processes

May-30-2025, 02:41:55 GMT–Neural Information Processing Systems

Safe reinforcement learning (RL) is a promising approach for many real-world decision-making problems where ensuring safety is a critical necessity. In safe RL research, while expected cumulative safety constraints (ECSCs) are typically the first choices, chance constraints are often more pragmatic for incorporating safety under uncertainties. This paper proposes a flipping-based policy for Chance-Constrained Markov Decision Processes (CCMDPs). The flipping-based policy selects the next action by tossing a potentially distorted coin between two action candidates. The probability of the flip and the two action candidates vary depending on the state.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

May-30-2025, 02:41:55 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts > Middlesex County (0.14)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.60)
    - Reinforcement Learning (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Flipping-based Policy for Chance-Constrained Markov Decision Processes

Similar Docs Excel Report more

Title	Similarity	Source
None found