Monte Carlo Value Iteration with Macro-Actions

Dec-31-2011–Neural Information Processing Systems

POMDP planning faces two major computational challenges: large state spaces and long planning horizons. The recently introduced Monte Carlo Value Iteration (MCVI) can tackle POMDPs with very large discrete state spaces or continuous state spaces, but its performance degrades when faced with long planning horizons. This paper presents Macro-MCVI, which extends MCVI by exploiting macro-actions for temporal abstraction. We provide sufficient conditions for Macro-MCVI to inherit the good theoretical properties of MCVI. Macro-MCVI does not require explicit construction of probabilistic models for macro-actions and is thus easy to apply in practice. Experiments show that Macro-MCVI substantially improves the performance of MCVI with suitable macro-actions.

artificial intelligence, machine learning, state space, (13 more...)

Neural Information Processing Systems

Dec-31-2011

Conferences PDF

Add feedback

Country:
- Asia > Singapore (0.14)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.94)

Duplicate Docs Excel Report

Title
Monte Carlo Value Iteration with Macro-Actions

Similar Docs Excel Report more

Title	Similarity	Source
None found