Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach