ConstrainedUpdateProjectionApproachtoSafe PolicyOptimization
–Neural Information Processing Systems
Safe reinforcement learning (RL) studies problems where an intelligent agent has to not only maximize reward but also avoid exploring unsafe areas.
Neural Information Processing Systems
Feb-8-2026, 10:36:10 GMT