Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Muhammad Faaiz Taufiq

Neural Information Processing Systems 

Propensity Score (MIPS) estimator, proving that MR achieves lower variance among a generalized family of MIPS estimators.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found