Hierarchical Critics Assignment for Multi-agent Reinforcement Learning

Feb-10-2019–arXiv.org Machine Learning

In this paper, we investigate the use of global information to speed up the learning process and increase the cumulative rewards of multi-agent reinforcement learning (MARL) tasks. Within the actor-critic MARL, we introduce multiple cooperative critics from two levels of the hierarchy and propose a hierarchical critic-based MARL algorithm. In our approach, the agent is allowed to receive information from local and global critics in a competition task. The agent not only receives low-level details but also considers coordination from high levels to obtain global information for increasing operational performance. Here, we define multiple cooperative critics in a top-down hierarchy, called the Hierarchical Critic Assignment (HCA) framework. Our experiment, a two-player tennis competition task performed in the Unity environment, tested the HCA multi-agent framework based on the Asynchronous Advantage Actor-Critic (A3C) with Proximal Policy Optimization (PPO) algorithm. The results showed that the HCA framework outperforms the non-hierarchical critic baseline method on MARL tasks.

agent, algorithm, hca framework, (14 more...)

arXiv.org Machine Learning

Feb-10-2019

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Tasmania (0.04)
  - New South Wales > Sydney (0.04)

Genre:
- Research Report > New Finding (0.69)

Industry:
- Government > Military (0.46)
- Leisure & Entertainment (0.36)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found