A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge

Huang, Shuangyao, Zhang, Haibo, Huang, Zhiyi

Jul-16-2025–arXiv.org Artificial Intelligence

This paper presents a multi-agent reinforcement learning (MARL) framework for cooperative collision avoidance of UA V swarms leveraging domain knowledge-driven reward. The reward is derived from knowledge in the domain of image processing, approximating contours on a two-dimensional field. By modeling obstacles as maxima on the field, collisions are inherently avoided as contours never go through peaks or intersect. Additionally, counters are smooth and energy-efficient. Our framework enables training with large swarm sizes as the agent interaction is minimized and the need for complex credit assignment schemes or observation sharing mechanisms in state-of-the-art MARL approaches are eliminated. Moreover, UA Vs obtain the ability to adapt to complex environments where contours may be nonviable or non-existent through intensive training. Extensive experiments are conducted to evaluate the performances of our framework against state-of-the-art MARL algorithms.

machine learning, reinforcement learning, trajectory, (18 more...)

arXiv.org Artificial Intelligence

Jul-16-2025

arXiv.org PDF

Add feedback

Country:
- Asia > China
  - Shaanxi Province > Xi'an (0.04)
- Oceania > New Zealand (0.04)

Genre:
- Research Report (0.50)

Industry:
- Information Technology (0.68)
- Transportation (0.73)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.46)
  - Robots > Autonomous Vehicles (1.00)