Constrained Update Projection Approach to Safe Policy Optimization

Open in new window