Solving Transition Independent Decentralized Markov Decision Processes

Becker, R., Zilberstein, S., Lesser, V., Goldman, C. V.

Dec-1-2004–Journal of Artificial Intelligence Research

Formal treatment of collaborative multi-agent systems has been lagging behind the rapid progress in sequential decision making by individual agents. Recent work in the area of decentralized Markov Decision Processes (MDPs) has contributed to closing this gap, but the computational complexity of these models remains a serious obstacle. To overcome this complexity barrier, we identify a specific class of decentralized MDPs in which the agents' transitions are independent. The class consists of independent collaborating agents that are tied together through a structured global reward function that depends on all of their histories of states and actions. We present a novel algorithm for solving this class of problems and examine its properties, both as an optimal algorithm and as an anytime algorithm. To the best of our knowledge, this is the first algorithm to optimally solve a non-trivial subclass of decentralized MDPs. It lays the foundation for further work in this area on both exact and approximate algorithms.

agent, algorithm, optimal coverage, (14 more...)

Journal of Artificial Intelligence Research

Dec-1-2004

Journals PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - United States
    - Washington > King County
      - Seattle (0.04)
    - New York
      - Richmond County > New York City (0.04)
      - Queens County > New York City (0.04)
      - New York County > New York City (0.04)
      - Kings County > New York City (0.04)
      - Bronx County > New York City (0.04)
    - Michigan > Washtenaw County
      - Ann Arbor (0.14)
    - Massachusetts > Hampshire County
      - Amherst (0.14)
    - California > San Francisco County
      - San Francisco (0.14)
  - Canada
    - Quebec > Montreal (0.04)
    - Alberta > Census Division No. 11
      - Edmonton Metropolitan Region > Edmonton (0.04)
- Europe > Italy
  - Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents
    - Agent Societies (0.88)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.85)