Multiagent Planning with Factored MDPs

Apr-6-2023, 16:43:25 GMT–Neural Information Processing Systems

We present a principled and efficient planning algorithm for cooperative multia- gent dynamic systems. A striking feature of our method is that the coordination and communication between the agents is not imposed, but derived directly from the system dynamics and function approximation architecture. We view the en- tire multiagent system as a single, large Markov decision process (MDP), which we assume can be represented in a factored way using a dynamic Bayesian net- work (DBN). The action space of the resulting MDP is the joint action space of the entire set of agents. Our approach is based on the use of factored linear value functions as an approximation to the joint value function.

action space, multiagent planning, value function, (2 more...)

Neural Information Processing Systems

Apr-6-2023, 16:43:25 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Learning Graphical Models
    - Directed Networks > Bayesian Learning (0.60)