Review for NeurIPS paper: Towards Safe Policy Improvement for Non-Stationary MDPs