Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies
Persson, Mika, Lidman, Jonas, Ljungberg, Jacob, Sandelius, Samuel, Andersson, Adam
–arXiv.org Artificial Intelligence
This work presents a conceptual study on the application of Multi-Agent Reinforcement Learning (MARL) for decentralized control of unmanned aerial vehicles to relay a critical data package to a known position. For this purpose, a family of deterministic games is introduced, designed for scaling studies for MARL. A robust baseline policy is proposed, which is based on restricting agent motion envelopes and applying Dijkstra's algorithm. Experimental results show that two off-the-shelf MARL algorithms perform competitively with the baseline for a small number of agents, but scalability issues arise as the number of agents increase.
arXiv.org Artificial Intelligence
Dec-11-2025
- Country:
- Europe > Sweden
- Stockholm > Stockholm (0.04)
- Vaestra Goetaland > Gothenburg (0.04)
- North America > United States (0.04)
- Europe > Sweden
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Aerospace & Defense (0.34)
- Information Technology (0.48)
- Technology: