Enhancing Safe Exploration Using Safety State Augmentation

Jan-19-2025, 02:49:00 GMT–Neural Information Processing Systems

Safe exploration is a challenging and important problem in model-free reinforcement learning (RL). Often the safety cost is sparse and unknown, which unavoidably leads to constraint violations - a phenomenon ideally to be avoided in safety-critical applications. We tackle this problem by augmenting the state-space with a safety state, which is nonnegative if and only if the constraint is satisfied. The value of this state also serves as a distance toward constraint violation, while its initial value indicates the available safety budget. This idea allows us to derive policies for scheduling the safety budget during training.

constraint, enhancing safe exploration, safety state augmentation, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 02:49:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)