ProvablyEfficientModel-FreeConstrainedRLwith LinearFunctionApproximation

Neural Information Processing Systems 

We study the constrained reinforcement learning problem, in which an agent aims tomaximize the expected cumulativereward subject toaconstraint on the expected total value of a utility function. In contrast to existing model-based approaches or model-free methods accompanied with a'simulator', we aim to develop thefirst model-free, simulator-freealgorithm that achieves a sublinear regret and a sublinear constraint violation even inlarge-scale systems.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found