Nonapproximability Results for Partially Observable Markov Decision Processes
Goldsmith, J., Lusena, C., Mundhenk, M.
–arXiv.org Artificial Intelligence
Here \unlikely" means \unless some complexity classes collapse," where the collapses considered are P NP, P PSPACE, or P EXP. Until or unless these collapses are shown to hold, any control-policy designer must choose between such performance guarantees and ecient computation. In this work, we show that uncertainty breeds uncertainty: In a controlled stochastic system with uncertainty (as modeled by a partially observable Markov decision process, for example), plans can be obtained eciently or with quality guarantees, but rarely both. Planning over stochastic domains with uncertainty is hard (in some cases PSPACEhard or even undecidable, see Papadimitriou & Tsitsiklis, 1987; Madani, Hanks, & Condon, 1999). Given that it is hard to nd an optimal plan or policy, it is natural to try to nd one that is \good enough". In the best of all possible worlds, this would mean having an algorithm that is guaranteed to be fast and to produce a policy that is reasonably close to the optimal policy.
arXiv.org Artificial Intelligence
Jun-1-2011
- Country:
- Europe > Germany (0.04)
- North America > United States
- New York (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Kentucky > Fayette County
- Lexington (0.04)
- California
- San Francisco County > San Francisco (0.14)
- San Mateo County > Menlo Park (0.04)
- Genre:
- Research Report (0.64)
- Technology: