Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage

Neural Information Processing Systems 

Offline safe reinforcement learning (RL) aims to find an optimal policy using a pre-collected dataset when data collection is impractical or risky.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found