[ICML 2021 Spotlight] DFAC Framework: Factorizing the Value Function via Quantile Mixture for…

Sep-26-2021, 07:05:10 GMT–#artificialintelligence

In multi-agent reinforcement learning (MARL), the environments are highly stochastic due to the partial observability of each agent and the continuously changing policies of the other agents. One of popular research directions is to enhance the training procedure of fully cooperative and decentralized agents. In the past few years, a number of MARL researchers turned their attention to centralized training with decentralized execution (CTDE). Among these CTDE approaches, value function factorization methods are especially promising in terms of their superior performances and data efficiency. Value function factorization methods introduce the assumption of individual-global-max (IGM) [1], which assumes that each agent's optimal actions result in the optimal joint actions of the entire group. Based on IGM, the total return of a group of agents can be factorized into separate utility functions for each agent.

agent, machine learning, value function factorization method, (10 more...)

#artificialintelligence

Sep-26-2021, 07:05:10 GMT

News Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games (0.32)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.31)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found