XXXXX

Apr-24-2026, 17:52:21 GMT–Neural Information Processing Systems

In contrast to the advances in characterizing the sample complexity for solving Markov decision processes (MDPs), the optimal statistical complexity for solving constrained MDPs (CMDPs) remains unknown. We resolve this question by providing minimax upper and lower bounds on the sample complexity for learning near-optimal policies in a discounted CMDP with access to a generative model (simulator). In particular, we design a model-based algorithm that addresses two settings: (i) relaxed feasibility, where small constraint violations are allowed, and (ii) strict feasibility, where the output policy is required to satisfy the constraint.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Apr-24-2026, 17:52:21 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Canada (0.46)
  - United States > California (0.28)

Genre:
- Research Report > New Finding (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Reinforcement Learning (0.95)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.35)

Duplicate Docs Excel Report

Title
XXXXX

Similar Docs Excel Report more

Title	Similarity	Source
None found