Robustness in Markov Decision Problems with Uncertain Transition Matrices

Dec-31-2004–Neural Information Processing Systems

Optimal solutions to Markov Decision Problems (MDPs) are very sensitive with respect to the state transition probabilities. In many practical problems, the estimation of those probabilities is far from accurate. Hence, estimation errors are limiting factors in applying MDPs to realworld problems. We propose an algorithm for solving finite-state and finite-action MDPs, where the solution is guaranteed to be robust with respect to estimation errors on the state transition probabilities. Our algorithm involves a statistically accurate yet numerically efficient representation of uncertainty, via Kullback-Leibler divergence bounds. The worst-case complexity of the robust algorithm is the same as the original Bellman recursion. Hence, robustness can be added at practically no extra computing cost.

algorithm, cost function, transition matrix, (12 more...)

Neural Information Processing Systems

Dec-31-2004

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - California > Alameda County
    - Berkeley (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.69)
  - Machine Learning > Learning Graphical Models (0.48)

Duplicate Docs Excel Report

Title
Robustness in Markov Decision Problems with Uncertain Transition Matrices
Robustness in Markov Decision Problems with Uncertain Transition Matrices

Similar Docs Excel Report more

Title	Similarity	Source
None found