Learning Control Policies for Stochastic Systems with Reach-avoid Guarantees
Žikelić, Đorđe, Lechner, Mathias, Henzinger, Thomas A., Chatterjee, Krishnendu
–arXiv.org Artificial Intelligence
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical systems with formal reach-avoid guarantees. This work presents the first method for providing formal reach-avoid guarantees, which combine and generalize stability and safety guarantees, with a tolerable probability threshold $p\in[0,1]$ over the infinite time horizon. Our method leverages advances in machine learning literature and it represents formal certificates as neural networks. In particular, we learn a certificate in the form of a reach-avoid supermartingale (RASM), a novel notion that we introduce in this work. Our RASMs provide reachability and avoidance guarantees by imposing constraints on what can be viewed as a stochastic extension of level sets of Lyapunov functions for deterministic systems. Our approach solves several important problems -- it can be used to learn a control policy from scratch, to verify a reach-avoid specification for a fixed control policy, or to fine-tune a pre-trained policy if it does not satisfy the reach-avoid specification. We validate our approach on $3$ stochastic non-linear reinforcement learning tasks.
arXiv.org Artificial Intelligence
Nov-29-2022
- Country:
- Oceania
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- Australia > New South Wales
- Sydney (0.04)
- New Zealand > North Island
- North America
- United States
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- California > Los Angeles County
- Los Angeles (0.04)
- Massachusetts > Middlesex County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- United States
- Europe
- Austria (0.04)
- Germany > Berlin (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Cambridgeshire > Cambridge (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Russia > Northwestern Federal District
- Leningrad Oblast > Saint Petersburg (0.04)
- Italy > Campania
- Naples (0.04)
- France > Île-de-France
- Asia
- Russia (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- China > Shaanxi Province
- Xi'an (0.04)
- Oceania
- Genre:
- Research Report (0.63)
- Industry:
- Government (0.46)
- Education (0.34)
- Technology: