HypRL: Reinforcement Learning of Control Policies for Hyperproperties
–Neural Information Processing Systems
Reward shaping in multi-agent reinforcement learning (MARL) for complex tasks remains a significant challenge. Existing approaches often fail to find optimal solutions or cannot efficiently handle such tasks. We propose HypRL, a specification-guided reinforcement learning framework that learns control policies w.r.t.
Neural Information Processing Systems
Jun-12-2026, 01:57:17 GMT
- Technology: