Model-Based Policy Search Using Monte Carlo Gradient Estimation with Real Systems Application