SPOT: Scalable Policy Optimization with Trees for Markov Decision Processes