Multi-Agent First Order Constrained Optimization in Policy Space

Jan-19-2025, 09:58:31 GMT–Neural Information Processing Systems

In the realm of multi-agent reinforcement learning (MARL), achieving high performance is crucial for a successful multi-agent system.Meanwhile, the ability to avoid unsafe actions is becoming an urgent and imperative problem to solve for real-life applications. Whereas, it is still challenging to develop a safety-aware method for multi-agent systems in MARL. In this work, we introduce a novel approach called Multi-Agent First Order Constrained Optimization in Policy Space (MAFOCOPS), which effectively addresses the dual objectives of attaining satisfactory performance and enforcing safety constraints. Using data generated from the current policy, MAFOCOPS first finds the optimal update policy by solving a constrained optimization problem in the nonparameterized policy space. Then, the update policy is projected back into the parametric policy space to achieve a feasible policy.

multi-agent system, order constrained optimization, policy space, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 09:58:31 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.43)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)