EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control

Wan, Hanwen, Chen, Yifei, Deng, Yixuan, Wei, Zeyu, Li, Dongrui, Lin, Zexin, Wu, Donghao, Cheng, Jiu, Ji, Xiaoqiang

Aug-18-2025–arXiv.org Artificial Intelligence

In response to these limitations, this work introduces a hierarchical Embodied system with an Agent-based planner, named EmbodiedAgent. EmbodiedAgent leverages a next-action prediction paradigm to establish a heterogeneous multi-robot control system. The core agent generates a single action and its corresponding arguments per inference, terminating upon receiving an end-of-planning signal, thus ensuring a controlled and concise execution process. To address the aforementioned challenges, we enhance the planner's robustness and generalizability through supervised fine-tuning. Extended from previous work MultiPlan [4], we present MultiPlan+, a large-scale dataset comprising 100 scenarios with over 18,000 tasks, enriched with a subset of impractical cases to mitigate hallucinations. Additionally, we develop an agent based on a fine-tuned language model equipped with function calling capabilities and structured memory. Specifically, robot skills, termination signals, and error signals related to impractical cases are encapsulated as tools, while planning history is organized within the structured memory. For low-level execution, we employ specialized policies trained on individual basic tasks to ensure reliable and robust performance. Furthermore, we propose a comprehensive R obot Planning A ssessment S chema ( RPAS), which moves beyond error-type diagnostics to emphasize stratified success rates assessed through both human evaluation and automated grading.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

Aug-18-2025

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.29)

Genre:
- Research Report (0.50)
- Workflow (0.47)

Industry:
- Education > Educational Technology > Educational Software > Computer-Aided Assessment (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning > Agents (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.70)