Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation