Scaling Up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion

Spaan, Matthijs T. J. (Institute for Systems and Robotics, Instituto Superior Técnico) | Oliehoek, Frans A. (Massachusetts Institute of Technology) | Amato, Christopher (Aptima, Inc)

Jul-19-2011–AAAI Conferences

Planning under uncertainty for multiagent systems can be formalized as a decentralized partially observable Markov decision process. We advance the state of the art for optimal solution of this model, building on the Multiagent A* heuristic search method. A key insight is that we can avoid the full expansion of a search node that generates a number of children that is doubly exponential in the node's depth. Instead, we incrementally expand the children only when a next child might have the highest heuristic value. We target a subsequent bottleneck by introducing a more memory-efficient representation for our heuristic functions. Proof is given that the resulting algorithm is correct and experiments demonstrate a significant speedup over the state of the art, allowing for optimal solutions over longer horizons for many benchmark problems.

expansion, representation, zilberstein, (17 more...)

AAAI Conferences

Jul-19-2011

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County
    - Cambridge (0.14)
    - Woburn (0.04)
- Europe > Portugal
  - Lisbon > Lisbon (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Search (1.00)
    - Agents (1.00)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found