Safe Reinforcement Learning with Natural Language Constraints

Oct-11-2024, 04:37:15 GMT–Neural Information Processing Systems

While safe reinforcement learning (RL) holds great promise for many practical applications like robotics or autonomous cars, current approaches require specifying constraints in mathematical form. Such specifications demand domain expertise, limiting the adoption of safe RL. In this paper, we propose learning to interpret natural language constraints for safe RL. To this end, we first introduce HAZARDWORLD, a new multi-task benchmark that requires an agent to optimize reward while not violating constraints specified in free-form text. We then develop an agent with a modular architecture that can interpret and adhere to such textual constraints while learning new tasks.

constraint, natural language constraint, safe reinforcement learning, (6 more...)

Neural Information Processing Systems

Oct-11-2024, 04:37:15 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.65)
  - Machine Learning > Reinforcement Learning (0.65)
  - Robots (0.63)