Conservative and Greedy Approaches to Classification-Based Policy Iteration