Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization

Open in new window