Estimating Q(s,s') with Deep Deterministic Dynamics Gradients