Control Variates for Slate Off-Policy Evaluation

Neural Information Processing Systems 

Swaminathan et al. (2017) have proposed the pseudoinverse (PI) estimator under the assumption

Similar Docs  Excel Report  more

TitleSimilaritySource
None found