SPO: Sequential Monte Carlo Policy Optimisation