Explainably Safe Reinforcement Learning
–Neural Information Processing Systems
Trust in a decision-making system requires both safety guarantees and the ability to interpret and understand its behavior. This is particularly important for learned systems, whose decision-making processes are often highly opaque. Shielding is a prominent model-based technique for enforcing safety in reinforcement learning. However, because shields are automatically synthesized using rigorous formal methods, their decisions are often similarly difficult for humans to interpret. Recently, decision trees became customary to represent controllers and policies.
Neural Information Processing Systems
Jun-23-2026, 00:38:54 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Overview (0.67)
- Industry:
- Transportation (0.68)
- Government (0.67)
- Information Technology (0.67)
- Automobiles & Trucks (0.46)
- Technology: