Finding Safe Zones of Markov Decision Processes Policies

Apr-30-2026, 01:18:54 GMT–Neural Information Processing Systems

Given a policy of a Markov Decision Process, we define a SAFEZONE as a subset of states, such that most of the policy's trajectories are confined to this subset. The quality of a SAFEZONE is parameterized by the number of states and the escape probability, i.e., the probability that a random trajectory will leave the subset. SAFEZONES are especially interesting when they have a small number of states and low escape probability. We study the complexity of finding optimal SAFEZONES, and show that in general, the problem is computationally hard. Our main result is a bi-criteria approximation learning algorithm with a factor of almost 2 approximation for both the escape probability and SAFEZONE size, using a polynomial size sample complexity.

artificial intelligence, machine learning, probability, (17 more...)

Neural Information Processing Systems

Apr-30-2026, 01:18:54 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.67)

Industry:
- Transportation (0.93)
- Government > Regional Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (0.93)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (1.00)

Duplicate Docs Excel Report

Title
dfaa29ed28dfa175bcc5e2a54aa199f8-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found