OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
–Neural Information Processing Systems
Offline safe reinforcement learning (RL) aims to train a policy that satisfies constraints using a pre-collected dataset.
Neural Information Processing Systems
Oct-10-2025, 09:22:28 GMT
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology (0.46)
- Technology: