Muti-Agent Proximal Policy Optimization For Data Freshness in UAV-assisted Networks

Ndiaye, Mouhamed Naby, Bergou, El Houcine, Hammouti, Hajar El

Mar-15-2023–arXiv.org Artificial Intelligence

Unmanned aerial vehicles (UAVs) are seen as a promising technology to perform a wide range of tasks in wireless communication networks. In this work, we consider the deployment of a group of UAVs to collect the data generated by IoT devices. Specifically, we focus on the case where the collected data is time-sensitive, and it is critical to maintain its timeliness. Our objective is to optimally design the UAVs' trajectories and the subsets of visited IoT devices such as the global Age-of-Updates (AoU) is minimized. To this end, we formulate the studied problem as a mixed-integer nonlinear programming (MINLP) under time and quality of service constraints. To efficiently solve the resulting optimization problem, we investigate the cooperative Multi-Agent Reinforcement Learning (MARL) framework and propose an RL approach based on the popular on-policy Reinforcement Learning (RL) algorithm: Policy Proximal Optimization (PPO). Our approach leverages the centralized training decentralized execution (CTDE) framework where the UAVs learn their optimal policies while training a centralized value function. Our simulation results show that the proposed MAPPO approach reduces the global AoU by at least a factor of 1/2 compared to conventional off-policy reinforcement learning approaches.

machine learning, reinforcement learning, uav, (16 more...)

arXiv.org Artificial Intelligence

Mar-15-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Middle East > Morocco (0.04)

Genre:
- Research Report (0.70)

Industry:
- Information Technology (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Robots > Autonomous Vehicles
    - Drones (1.00)
  - Representation & Reasoning
    - Optimization (1.00)
    - Agents (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found