Saturated Path-Constrained MDP: Planning under Uncertainty and Deterministic Model-Checking Constraints
Sprauel, Jonathan (ONERA – The French Aerospace Lab) | Kolobov, Andrey (Microsoft Research) | Teichteil-Königsbuch, Florent (ONERA – The French Aerospace Lab)
In many probabilistic planning scenarios, a system’s behavior needs to not only maximize the expected utility but also obey certain restrictions. This paper presents Saturated Path-Constrained Markov Decision Processes (SPC MDPs), a new MDP type for planning under uncertainty with deterministic model-checking constraints, e.g., "state s must be visited befores s'", "the system must end up in s", or "the system must never enter s". We present a mathematical analysis of SPCMDPs, showing that although SPC MDPs generally have no optimal policies, every instance of this class has an epsilon-optimal randomized policy for any > 0. We propose a dynamic programming-based algorithm for finding such policies, and empirically demonstrate this algorithm to be orders of magnitude faster than its next-best alternative.
Jul-14-2014
- Country:
- North America > United States
- Washington > King County
- Redmond (0.04)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Europe > France
- Occitanie > Haute-Garonne > Toulouse (0.04)
- North America > United States