Continuous-time reinforcement learning: ellipticity enables model-free value function approximation

Open in new window