Safe Model-based Reinforcement Learning with Stability Guarantees
Berkenkamp, Felix, Turchetta, Matteo, Schoellig, Angela, Krause, Andreas
–Neural Information Processing Systems
Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorithm that explicitly considers safety, defined in terms of stability guarantees. Specifically, we extend control-theoretic results on Lyapunov stability verification and show how to use statistical models of the dynamics to obtain high-performance control policies with provable stability certificates.
Neural Information Processing Systems
Feb-14-2020, 06:27:26 GMT
- Technology: