Learning Safety Constraints From Demonstration Using One-Class Decision Trees

Baert, Mattijs, Leroux, Sam, Simoens, Pieter

Dec-14-2023–arXiv.org Artificial Intelligence

The alignment of autonomous agents with human values is a pivotal challenge when deploying these agents within physical environments, where safety is an important concern. However, defining the agent's objective as a reward and/or cost function is inherently complex and prone to human errors. In response to this challenge, we present a novel approach that leverages one-class decision trees to facilitate learning from expert demonstrations. These decision trees provide a foundation for representing a set of constraints pertinent to the given environment as a logical formula in disjunctive normal form. The learned constraints are subsequently employed within an oracle constrained reinforcement learning framework, enabling the acquisition of a safe policy. In contrast to other methods, our approach offers an interpretable representation of the constraints, a vital feature in safety-critical environments. To validate the effectiveness of our proposed method, we conduct experiments in synthetic benchmark domains and a realistic driving environment.

constraint, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

Dec-14-2023

arXiv.org PDF

Add feedback

Country:
- Africa
  - Ethiopia > Addis Ababa
    - Addis Ababa (0.04)
  - Rwanda > Kigali
    - Kigali (0.04)
- Asia > Middle East
  - Israel > Haifa District > Haifa (0.04)
- Europe > Belgium
  - Flanders > East Flanders > Ghent (0.04)
- North America > United States
  - Illinois > Cook County > Chicago (0.04)

Genre:
- Research Report (0.84)

Industry:
- Transportation (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Decision Tree Learning (0.81)
    - Reinforcement Learning (0.69)
    - Supervised Learning (0.68)
  - Representation & Reasoning
    - Agents (0.88)
    - Constraint-Based Reasoning (0.68)
    - Diagnosis (0.81)