Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes