Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage
–Neural Information Processing Systems
Offline safe reinforcement learning (RL) aims to find an optimal policy using a pre-collected dataset when data collection is impractical or risky.
Neural Information Processing Systems
Feb-11-2026, 13:05:04 GMT
- Country:
- North America > United States
- Washington (0.04)
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.04)
- North America > United States
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.68)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology: