On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient