Verification-Aware Planning for Multi-Agent Systems
Xu, Tianyang, Zhang, Dan, Mitra, Kushan, Hruschka, Estevam
–arXiv.org Artificial Intelligence
Large language model (LLM) agents are increasingly deployed to tackle complex tasks, often necessitating collaboration among multiple specialized agents. However, multi-agent collaboration introduces new challenges in planning, coordination, and verification. Execution failures frequently arise not from flawed reasoning alone, but from subtle misalignments in task interpretation, output format, or inter-agent handoffs. To address these challenges, we present VeriMAP, a framework for multi-agent collaboration with verification-aware planning. The VeriMAP planner decomposes tasks, models subtask dependencies, and encodes planner-defined passing criteria as subtask verification functions (VFs) in Python and natural language. We evaluate VeriMAP on diverse datasets, demonstrating that it outperforms both single- and multi-agent baselines while enhancing system robustness and interpretability. Our analysis highlights how verification-aware planning enables reliable coordination and iterative refinement in multi-agent systems, without relying on external labels or annotations.
arXiv.org Artificial Intelligence
Oct-21-2025
- Country:
- Europe > Austria (0.28)
- North America
- United States (0.28)
- Mexico (0.28)
- Genre:
- Research Report (1.00)
- Workflow (0.68)
- Technology: