Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Feb-16-2026, 10:54:35 GMT–Neural Information Processing Systems

We name this problem Safe-RL-SW . Our step-wise violation constraint differs from prior expected violation constraint (Wachi & Sui, 2020; Efroni et al., 2020b; Kalagarla et al., 2021) in two aspects: (i) Minimizing the step-wise violation enables the agent to learn an optimal policy that avoids unsafe regions deterministically,

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Feb-16-2026, 10:54:35 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.04)
- North America > United States
  - Illinois (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning > Reinforcement Learning (0.67)
  - Representation & Reasoning > Constraint-Based Reasoning (0.46)

Duplicate Docs Excel Report

Title
aa3e67220ca4cd50010165c950fc8056-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found