Safe Evaluation For Offline Learning: Are We Ready To Deploy?