Scalable Constrained Policy Optimization for Safe Multi-agent Reinforcement Learning

Mar-22-2026, 22:18:54 GMT–Neural Information Processing Systems

A challenging problem in seeking to bring multi-agent reinforcement learning (MARL) techniques into real-world applications, such as autonomous driving and drone swarms, is how to control multiple agents safely and cooperatively to accomplish tasks. Most existing safe MARL methods learn the centralized value function by introducing a global state to guide safety cooperation.

artificial intelligence, machine learning, proceedings, (4 more...)

Neural Information Processing Systems

Mar-22-2026, 22:18:54 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning (1.00)