Hyperproperty-Constrained Secure Reinforcement Learning
Bonnah, Ernest, Nguyen, Luan Viet, Hoque, Khaza Anuarul
–arXiv.org Artificial Intelligence
Hyperproperties for Time Window Temporal Logic (HyperTWTL) is a domain-specific formal specification language known for its effectiveness in compactly representing security, opacity, and concurrency properties for robotics applications. This paper focuses on HyperTWTL-constrained secure reinforcement learning (SecRL). Although temporal logic-constrained safe reinforcement learning (SRL) is an evolving research problem with several existing literature, there is a significant research gap in exploring security-aware reinforcement learning (RL) using hyperproperties. Given the dynamics of an agent as a Markov Decision Process (MDP) and opacity/security constraints formalized as HyperTWTL, we propose an approach for learning security-aware optimal policies using dynamic Boltzmann softmax RL while satisfying the HyperTWTL constraints. The effectiveness and scalability of our proposed approach are demonstrated using a pick-up and delivery robotic mission case study. We also compare our results with two other baseline RL algorithms, showing that our proposed method outperforms them.
arXiv.org Artificial Intelligence
Aug-4-2025
- Country:
- Asia
- Middle East > Republic of Türkiye
- Aksaray Province > Aksaray (0.04)
- Taiwan > Taiwan Province
- Taipei (0.05)
- Middle East > Republic of Türkiye
- North America > United States
- Missouri > Boone County
- Columbia (0.14)
- New York > New York County
- New York City (0.04)
- Ohio > Montgomery County
- Dayton (0.04)
- Texas > McLennan County
- Waco (0.04)
- Missouri > Boone County
- Asia
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: