Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

Open in new window