Towards Safe Reinforcement Learning with a Safety Editor Policy Haonan Yu, Wei Xu, and Haichao Zhang Horizon Robotics Cupertino, CA95014 {haonan.yu,wei.xu,haichao.zhang }@horizon.ai
–Neural Information Processing Systems
Assuming no prior knowledge or pre-training of the environment safety model given a task, an agent has to learn, via exploration, which states and actions are safe.
Neural Information Processing Systems
Oct-2-2025, 09:59:15 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- California > Santa Clara County > Cupertino (0.04)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment > Games (0.67)
- Technology: