Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning

Open in new window