Physical Deep Reinforcement Learning Towards Safety Guarantee
Cao, Hongpeng, Mao, Yanbing, Sha, Lui, Caccamo, Marco
–arXiv.org Artificial Intelligence
Deep reinforcement learning (DRL) has achieved tremendous success in many complex decision-making tasks of autonomous systems with high-dimensional state and/or action spaces. However, the safety and stability still remain major concerns that hinder the applications of DRL to safety-critical autonomous systems. To address the concerns, we proposed the Phy-DRL: a physical deep reinforcement learning framework. The Phy-DRL is novel in two architectural designs: i) Lyapunov-like reward, and ii) residual control (i.e., integration of physics-model-based control and data-driven control). The concurrent physical reward and residual control empower the Phy-DRL the (mathematically) provable safety and stability guarantees. Through experiments on the inverted pendulum, we show that the Phy-DRL features guaranteed safety and stability and enhanced robustness, while offering remarkably accelerated training and enlarged reward.
arXiv.org Artificial Intelligence
Mar-29-2023
- Country:
- Europe
- Germany > Bavaria
- Upper Bavaria > Munich (0.05)
- Romania (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > Bavaria
- North America > United States
- Illinois > Champaign County
- Urbana (0.04)
- Michigan > Wayne County
- Detroit (0.04)
- Illinois > Champaign County
- Europe
- Genre:
- Research Report (0.40)
- Industry:
- Transportation > Ground > Road (0.94)
- Technology: