Solving POMDPs by Searching the Space of Finite Policies

Meuleau, Nicolas, Kim, Kee-Eung, Kaelbling, Leslie Pack, Cassandra, Anthony R.

Jan-23-2013–arXiv.org Artificial Intelligence

Solving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from a restricted set of policies, represented as finite state automata of a given size. This problem is also intractable, but we show that the complexity can be greatly reduced when the POMDP and/or policy are further constrained. We demonstrate good empirical results with a branch-and-bound method for finding globally optimal deterministic policies, and a gradient-ascent method for finding locally optimal stochastic policies.

artificial intelligence, machine learning, policy graph, (17 more...)

arXiv.org Artificial Intelligence

Jan-23-2013

arXiv.org PDF

Add feedback

Country:
- North America > United States > Massachusetts (0.47)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found