Hierarchical Reinforcement Learning for Swarm Confrontation with High Uncertainty

Wu, Qizhen, Liu, Kexin, Chen, Lei, Lü, Jinhu

Jun-12-2024–arXiv.org Artificial Intelligence

In swarm robotics, confrontation including the pursuit-evasion game is a key scenario. High uncertainty caused by unknown opponents' strategies and dynamic obstacles complicates the action space into a hybrid decision process. Although the deep reinforcement learning method is significant for swarm confrontation since it can handle various sizes, as an end-to-end implementation, it cannot deal with the hybrid process. Here, we propose a novel hierarchical reinforcement learning approach consisting of a target allocation layer, a path planning layer, and the underlying dynamic interaction mechanism between the two layers, which indicates the quantified uncertainty. It decouples the hybrid process into discrete allocation and continuous planning layers, with a probabilistic ensemble model to quantify the uncertainty and regulate the interaction frequency adaptively. Furthermore, to overcome the unstable training process introduced by the two layers, we design an integration training method including pre-training and cross-training, which enhances the training efficiency and stability. Experiment results in both comparison and ablation studies validate the effectiveness and generalization performance of our proposed approach.

confrontation, pursuer, swarm confrontation, (14 more...)

arXiv.org Artificial Intelligence

Jun-12-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America > United States
  - New Jersey > Mercer County > Princeton (0.04)
- Asia
  - Japan > Honshū
    - Chūgoku > Okayama Prefecture > Okayama (0.04)
  - China
    - Beijing > Beijing (0.05)
    - Shandong Province > Jinan (0.04)
    - Jiangsu Province > Nanjing (0.04)
    - Guangdong Province > Guangzhou (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Government > Military (0.68)
- Leisure & Entertainment > Games (0.49)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found