Continuous-time reinforcement learning: ellipticity enables model-free value function approximation