Approximate Dec-POMDP Solving Using Multi-Agent A*

Koops, Wietze, Junges, Sebastian, Jansen, Nils

May-9-2024–arXiv.org Artificial Intelligence

We present an A*-based algorithm to compute policies for finite-horizon Dec-POMDPs. Our goal is to sacrifice optimality in favor of scalability for larger horizons. The main ingredients of our approach are (1) using clustered sliding window memory, (2) pruning the A* search tree, and (3) using novel A* heuristics. Our experiments show competitive performance to the state-of-the-art. Moreover, for multiple benchmarks, we achieve superior performance. In addition, we provide an A* algorithm that finds upper bounds for the optimum, tailored towards problems with long horizons. The main ingredient is a new heuristic that periodically reveals the state, thereby limiting the number of reachable beliefs. Our experiments demonstrate the efficacy and scalability of the approach.

algorithm, dec-pomdp, window memory, (16 more...)

arXiv.org Artificial Intelligence

May-9-2024

arXiv.org PDF

Add feedback

Country:
- Europe
  - Slovenia (0.04)
  - Germany (0.04)
  - Netherlands > Gelderland
    - Nijmegen (0.04)

Genre:
- Research Report (0.49)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Search (1.00)
    - Agents (1.00)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found