Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling

Open in new window