Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning

Apr-18-2021–arXiv.org Artificial Intelligence

In this paper, we propose actor-critic approaches by introducing an actor policy on QMIX [9], which can remove the monotonicity constraint of QMIX and implement a non-monotonic value function factorization for joint action-value. We evaluate our actor-critic methods on StarCraft II micromanagement tasks, and show that it has a stronger performance on maps with heterogeneous agent types.

agent, deep multi-agent reinforcement learning, monotonicity constraint, (10 more...)

arXiv.org Artificial Intelligence

Apr-18-2021

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.50)

Industry:
- Leisure & Entertainment (0.37)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found