Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

Ackermann, Johannes, Gabler, Volker, Osa, Takayuki, Sugiyama, Masashi

Oct-3-2019–arXiv.org Artificial Intelligence

Many real world tasks require multiple agents to work together. Multi-agent reinforcement learning (RL) methods have been proposed in recent years to solve these tasks, but current methods often fail to efficiently learn policies. We thus investigate the presence of a common weakness in single-agent RL, namely value function overestimation bias, in the multi-agent setting. Based on our findings, we propose an approach that reduces this bias by using double centralized critics. We evaluate it on six mixed cooperative-competitive tasks, showing a significant advantage over current methods. Finally, we investigate the application of multi-agent methods to high-dimensional robotic tasks and show that our approach can be used to learn decentralized policies in this domain.

agent, communication, maddpg, (13 more...)

arXiv.org Artificial Intelligence

Oct-3-2019

arXiv.org PDF

Add feedback

Country:
- Europe
  - Switzerland (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
- Asia > Japan
  - Kyūshū & Okinawa > Kyūshū (0.04)
  - Honshū > Kantō
    - Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre:
- Research Report (0.84)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found