Finding Safe Zones of Markov Decision Processes Policies

Neural Information Processing Systems 

One notable exception to that is Safe RL which addresses the concept of safety. Traditional Safe RL focuses on finding the best policy that meets safety requirements, typically by either adjusting the objective to include the safety requirements and then optimizing for it, or incorporating additional safety constraints to the exploration.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found