Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies

Persson, Mika, Lidman, Jonas, Ljungberg, Jacob, Sandelius, Samuel, Andersson, Adam

Dec-11-2025–arXiv.org Artificial Intelligence

This work presents a conceptual study on the application of Multi-Agent Reinforcement Learning (MARL) for decentralized control of unmanned aerial vehicles to relay a critical data package to a known position. For this purpose, a family of deterministic games is introduced, designed for scaling studies for MARL. A robust baseline policy is proposed, which is based on restricting agent motion envelopes and applying Dijkstra's algorithm. Experimental results show that two off-the-shelf MARL algorithms perform competitively with the baseline for a small number of agents, but scalability issues arise as the number of agents increase.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

Dec-11-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Sweden
  - Stockholm > Stockholm (0.04)
  - Vaestra Goetaland > Gothenburg (0.04)
- North America > United States (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Aerospace & Defense (0.34)
- Information Technology (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.69)
  - Robots > Autonomous Vehicles
    - Drones (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found