Faster Policy Learning with Continuous-Time Gradients