Safe Policy Improvement Approaches on Discrete Markov Decision Processes