A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge
Huang, Shuangyao, Zhang, Haibo, Huang, Zhiyi
–arXiv.org Artificial Intelligence
This paper presents a multi-agent reinforcement learning (MARL) framework for cooperative collision avoidance of UA V swarms leveraging domain knowledge-driven reward. The reward is derived from knowledge in the domain of image processing, approximating contours on a two-dimensional field. By modeling obstacles as maxima on the field, collisions are inherently avoided as contours never go through peaks or intersect. Additionally, counters are smooth and energy-efficient. Our framework enables training with large swarm sizes as the agent interaction is minimized and the need for complex credit assignment schemes or observation sharing mechanisms in state-of-the-art MARL approaches are eliminated. Moreover, UA Vs obtain the ability to adapt to complex environments where contours may be nonviable or non-existent through intensive training. Extensive experiments are conducted to evaluate the performances of our framework against state-of-the-art MARL algorithms.
arXiv.org Artificial Intelligence
Jul-16-2025
- Country:
- Asia > China
- Shaanxi Province > Xi'an (0.04)
- Oceania > New Zealand (0.04)
- Asia > China
- Genre:
- Research Report (0.50)
- Industry:
- Information Technology (0.68)
- Transportation (0.73)
- Technology: