Iterative Reachability Estimation for Safe Reinforcement Learning
–Neural Information Processing Systems
We theoretically establish that our algorithms almost surely converge to locally optimal policies of our safe optimization framework.
Neural Information Processing Systems
Oct-9-2025, 09:21:08 GMT