ProvablyEfficientModel-FreeConstrainedRLwith LinearFunctionApproximation
–Neural Information Processing Systems
We study the constrained reinforcement learning problem, in which an agent aims tomaximize the expected cumulativereward subject toaconstraint on the expected total value of a utility function. In contrast to existing model-based approaches or model-free methods accompanied with a'simulator', we aim to develop thefirst model-free, simulator-freealgorithm that achieves a sublinear regret and a sublinear constraint violation even inlarge-scale systems.
Neural Information Processing Systems
Feb-9-2026, 02:31:19 GMT
- Country:
- North America > United States
- Michigan > Wayne County
- Detroit (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Michigan > Wayne County
- South America > Chile
- North America > United States
- Technology: