Constrained Update Projection Approach to Safe Policy Optimization Long Y ang
–Neural Information Processing Systems
Safe reinforcement learning (RL) studies problems where an intelligent agent has to not only maximize reward but also avoid exploring unsafe areas.
Neural Information Processing Systems
Nov-20-2025, 08:53:45 GMT
- Country:
- Asia > China
- Guangdong Province > Shenzhen (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- South America > Chile
- Asia > China
- Genre:
- Research Report (0.45)
- Workflow (0.46)
- Technology: