Iterative Reachability Estimation for Safe Reinforcement Learning

Neural Information Processing Systems 

We theoretically establish that our algorithms almost surely converge to locally optimal policies of our safe optimization framework.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found