Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions

Jun-11-2026, 00:02:10 GMT–Neural Information Processing Systems

Establishing stability certificates for closed-loop systems under reinforcement learning (RL) policies is essential to move beyond empirical performance and offer guarantees of system behavior. Classical Lyapunov methods require a strict stepwise decrease in the Lyapunov function but such certificates are difficult to construct for learned policies. The RL value function is a natural candidate but it is not well understood how it can be adapted for this purpose. To gain intuition, we first study the linear quadratic regulator (LQR) problem and make two key observations. First, a Lyapunov function can be obtained from the value function of an LQR policy by augmenting it with a residual term related to the system dynamics and stage cost.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Jun-11-2026, 00:02:10 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.79)