Long-term Off-Policy Evaluation and Learning