PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization

Berto, Federico, Hua, Chuanbo, Luttmann, Laurin, Son, Jiwoo, Park, Junyoung, Ahn, Kyuree, Kwon, Changhyun, Xie, Lin, Park, Jinkyoo

Sep-5-2024–arXiv.org Artificial Intelligence

Multi-agent combinatorial optimization problems such as routing and scheduling have great practical relevance but present challenges due to their NP-hard combinatorial nature, hard constraints on the number of possible agents, and hard-to-optimize objective functions. This paper introduces PARCO (Parallel AutoRegressive Combinatorial Optimization), a novel approach that learns fast surrogate solvers for multi-agent combinatorial problems with reinforcement learning by employing parallel autoregressive decoding. We propose a model with a Multiple Pointer Mechanism to efficiently decode multiple decisions simultaneously by different agents, enhanced by a Priority-based Conflict Handling scheme. Moreover, we design specialized Communication Layers that enable effective agent collaboration, thus enriching decision-making. We evaluate PARCO in representative multi-agent combinatorial problems in routing and scheduling and demonstrate that our learned solvers offer competitive results against both classical and neural baselines in terms of both solution quality and speed. We make our code openly available at https://github.com/ai4co/parco.

agent, arxiv preprint arxiv, parco, (11 more...)

arXiv.org Artificial Intelligence

Sep-5-2024

arXiv.org PDF

Add feedback

Country:
- Asia > South Korea > Seoul > Seoul (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Industry:
- Transportation (0.72)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning
  - Search (1.00)
  - Agents (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found