Probabilistic Shielding for Safe Reinforcement Learning

Court, Edwin Hamel-De le, Belardinelli, Francesco, Goodall, Alex W.

Mar-17-2025–arXiv.org Machine Learning

In real-life scenarios, a Reinforcement Learning (RL) agent aiming to maximise their reward, must often also behave in a safe manner, including at training time. Thus, much attention in recent years has been given to Safe RL, where an agent aims to learn an optimal policy among all policies that satisfy a given safety constraint. However, strict safety guarantees are often provided through approaches based on linear programming, and thus have limited scaling. In this paper we present a new, scalable method, which enjoys strict formal guarantees for Safe RL, in the case where the safety dynamics of the Markov Decision Process (MDP) are known, and safety is defined as an undiscounted probabilistic avoidance property. Our approach is based on state-augmentation of the MDP, and on the design of a shield that restricts the actions available to the agent. We show that our approach provides a strict formal safety guarantee that the agent stays safe at training and test time. Furthermore, we demonstrate that our approach is viable in practice through experimental evaluation.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

Mar-17-2025

arXiv.org PDF

Add feedback

Country:
- Oceania
  - New Zealand > North Island
    - Auckland Region > Auckland (0.04)
  - Australia > New South Wales
    - Sydney (0.04)
- North America
  - Canada (0.04)
  - United States
    - Maryland > Baltimore (0.04)
    - New York
      - New York County > New York City (0.14)
      - Richmond County > New York City (0.04)
      - Queens County > New York City (0.04)
      - Kings County > New York City (0.04)
      - Bronx County > New York City (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > Los Angeles County
      - Los Angeles (0.14)
- Europe
  - Austria > Vienna (0.14)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Macao (0.04)
  - China (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found