SABLAS: Learning Safe Control for Black-box Dynamical Systems

Jan-8-2022–arXiv.org Artificial Intelligence

Control certificates based on barrier functions have been a powerful tool to generate probably safe control policies for dynamical systems. However, existing methods based on barrier certificates are normally for white-box systems with differentiable dynamics, which makes them inapplicable to many practical applications where the system is a black-box and cannot be accurately modeled. On the other side, model-free reinforcement learning (RL) methods for black-box systems suffer from lack of safety guarantees and low sampling efficiency. In this paper, we propose a novel method that can learn safe control policies and barrier certificates for black-box dynamical systems, without requiring for an accurate system model. Our method re-designs the loss function to back-propagate gradient to the control policy even when the black-box dynamical system is non-differentiable, and we show that the safety certificates hold on the black-box system. Empirical results in simulation show that our method can significantly improve the performance of the learned policies by achieving nearly 100% safety and goal reaching rates using much fewer training samples, compared to state-of-the-art black-box safe control methods. Our learned agents can also generalize to unseen scenarios while keeping the original performance. The source code can be found at https://github.com/Zengyi-Qin/bcbf.

certificate, controller, dynamical system, (12 more...)

arXiv.org Artificial Intelligence

Jan-8-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Illinois (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.14)
- Europe > Denmark
  - North Jutland > Aalborg (0.04)
- Asia
  - Singapore (0.14)
  - Middle East > Jordan (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Industry:
- Transportation > Air (1.00)

Technology:
- Information Technology
  - Scientific Computing (1.00)
  - Artificial Intelligence
    - Robots (1.00)
    - Representation & Reasoning > Agents (0.68)
    - Machine Learning
      - Neural Networks (1.00)
      - Statistical Learning (0.68)