On-line Policy Improvement using Monte-Carlo Search