Shield Decentralization for Safe Multi-Agent Reinforcement Learning

Oct-11-2024, 03:22:19 GMT–Neural Information Processing Systems

Learning safe solutions is an important but challenging problem in multi-agent reinforcement learning (MARL). Shielded reinforcement learning is one approach for preventing agents from choosing unsafe actions. Current shielded reinforcement learning methods for MARL make strong assumptions about communication and full observability. In this work, we extend the formalization of the shielded reinforcement learning problem to a decentralized multi-agent setting. We then present an algorithm for decomposition of a centralized shield, allowing shields to be used in such decentralized, communication-free environments.

safe multi-agent reinforcement learning, shield decentralization

Neural Information Processing Systems

Oct-11-2024, 03:22:19 GMT

Conferences Web Page

Add feedback

Genre:
- Play > Prospect > Charge (1.00)

Industry:
- Education > Focused Education > Special Education (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)