Constrained Cross-Entropy Method for Safe Reinforcement Learning

Feb-14-2020, 19:43:25 GMT–Neural Information Processing Systems

We study a safe reinforcement learning problem in which the constraints are defined as the expected cost over finite-length trajectories. We propose a constrained cross-entropy-based method to solve this problem. The method explicitly tracks its performance with respect to constraint satisfaction and thus is well-suited for safety-critical applications. We show that the asymptotic behavior of the proposed algorithm can be almost-surely described by that of an ordinary differential equation. Then we give sufficient conditions on the properties of this differential equation to guarantee the convergence of the proposed algorithm.

algorithm, constrained cross-entropy method, safe reinforcement learning, (1 more...)

Neural Information Processing Systems

Feb-14-2020, 19:43:25 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Focused Education > Special Education (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)