Learning Control Policies for Stochastic Systems with Reach-avoid Guarantees

Žikelić, Đorđe, Lechner, Mathias, Henzinger, Thomas A., Chatterjee, Krishnendu

Nov-29-2022–arXiv.org Artificial Intelligence

We study the problem of learning controllers for discrete-time non-linear stochastic dynamical systems with formal reach-avoid guarantees. This work presents the first method for providing formal reach-avoid guarantees, which combine and generalize stability and safety guarantees, with a tolerable probability threshold $p\in[0,1]$ over the infinite time horizon. Our method leverages advances in machine learning literature and it represents formal certificates as neural networks. In particular, we learn a certificate in the form of a reach-avoid supermartingale (RASM), a novel notion that we introduce in this work. Our RASMs provide reachability and avoidance guarantees by imposing constraints on what can be viewed as a stochastic extension of level sets of Lyapunov functions for deterministic systems. Our approach solves several important problems -- it can be used to learn a control policy from scratch, to verify a reach-avoid specification for a fixed control policy, or to fine-tune a pre-trained policy if it does not satisfy the reach-avoid specification. We validate our approach on $3$ stochastic non-linear reinforcement learning tasks.

artificial intelligence, machine learning, rasm, (17 more...)

arXiv.org Artificial Intelligence

Nov-29-2022

arXiv.org PDF

Add feedback

Country:
- Asia (0.67)
- Europe (1.00)
- North America > United States
  - Massachusetts (0.28)

Genre:
- Research Report (0.63)

Industry:
- Education (0.34)
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.89)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found