Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods