Provably Efficient Model-Free Constrained RL with Linear Function Approximation
–Neural Information Processing Systems
We study the constrained reinforcement learning problem, in which an agent aims to maximize the expected cumulative reward subject to a constraint on the expected total value of a utility function. In contrast to existing model-based approaches or model-free methods accompanied with a'simulator', we aim to develop the first model-free, simulator-free algorithm that achieves a sublinear regret and a sublinear constraint violation even in large-scale systems.
Neural Information Processing Systems
Nov-14-2025, 07:52:16 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Michigan > Wayne County
- Detroit (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Michigan > Wayne County
- South America > Chile
- Asia > Middle East
- Genre:
- Research Report (0.46)
- Technology: