Monte-Carlo Planning in Large POMDPs

Apr-6-2023, 13:41:43 GMT–Neural Information Processing Systems

This paper introduces a Monte-Carlo algorithm for online planning in large POMDPs. The algorithm combines a Monte-Carlo update of the agent's belief state with a Monte-Carlo tree search from the current belief state. The new algorithm, POMCP, has two important properties. First, Monte-Carlo sampling is used to break the curse of dimensionality both during belief state updates and during planning. Second, only a black box simulator of the POMDP is required, rather than explicit probability distributions.

algorithm, monte-carlo planning, pomdp, (3 more...)

Neural Information Processing Systems

Apr-6-2023, 13:41:43 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Belief Revision (1.00)
    - Planning & Scheduling (0.98)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (1.00)