Achieving O (1 /ε) Sample Complexity for Constrained Markov Decision Process

Neural Information Processing Systems 

We consider the reinforcement learning problem for the constrained Markov decision process (CMDP), which plays a central role in satisfying safety or resource constraints in sequential learning and decision-making.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found