Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet

Xu, Kaidi, Van Huynh, Nguyen, Li, Geoffrey Ye

Dec-15-2022–arXiv.org Artificial Intelligence

In heterogeneous networks (HetNets), the overlap of small cells and the macro cell causes severe cross-tier interference. Although there exist some approaches to address this problem, they usually require global channel state information, which is hard to obtain in practice, and get the sub-optimal power allocation policy with high computational complexity. To overcome these limitations, we propose a multi-agent deep reinforcement learning (MADRL) based power control scheme for the HetNet, where each access point makes power control decisions independently based on local information. To promote cooperation among agents, we develop a penalty-based Q learning (PQL) algorithm for MADRL systems. By introducing regularization terms in the loss function, each agent tends to choose an experienced action with high reward when revisiting a state, and thus the policy updating speed slows down. In this way, an agent's policy can be learned by other agents more easily, resulting in a more efficient collaboration process. Simulation results show that our proposed PQL can learn the desired power control policy from a dynamic environment where the locations of users change episodically and outperform existing DTE MADRL algorithms. The authors are with the Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, U.K. (e-mail: k.xu21@imperial.ac.uk; huynh.nguyen@imperial.ac.uk; geoffrey.li@imperial.ac.uk) In conventional cellular networks, a macro base station (BS) needs to provide access to the core network for all user devices (UDs) in the cell.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Dec-15-2022

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom (0.24)
- Oceania > Australia
  - New South Wales > Sydney (0.14)
- North America
  - United States
    - Massachusetts > Hampshire County
      - Amherst (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
- Asia > China
  - Shanghai > Shanghai (0.04)

Genre:
- Research Report (0.84)

Industry:
- Telecommunications (1.00)
- Education > Educational Setting (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents
    - Agent Societies (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found