Explainably Safe Reinforcement Learning

Jun-23-2026, 00:38:54 GMT–Neural Information Processing Systems

Trust in a decision-making system requires both safety guarantees and the ability to interpret and understand its behavior. This is particularly important for learned systems, whose decision-making processes are often highly opaque. Shielding is a prominent model-based technique for enforcing safety in reinforcement learning. However, because shields are automatically synthesized using rigorous formal methods, their decisions are often similarly difficult for humans to interpret. Recently, decision trees became customary to represent controllers and policies.

machine learning, reinforcement learning, shield, (18 more...)

Neural Information Processing Systems

Jun-23-2026, 00:38:54 GMT

Conferences PDF

Add feedback

Country:
- Europe > Austria (0.28)

Genre:
- Research Report > Experimental Study (1.00)
- Overview (0.67)

Industry:
- Transportation (0.68)
- Government (0.67)
- Information Technology (0.67)
- Automobiles & Trucks (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found