Control Variates for Slate Off-Policy Evaluation: Supplementary Text Nikos Vlassis Netflix Ashok Chandrashekar WarnerMedia Fernando Amat Gil Netflix Nathan Kallus Cornell University and Netflix

Neural Information Processing Systems 

In this Appendix we provide additional details about the MSLR-WEB30K data and the experimental protocol that we followed, we prove Lemma 11 of the main paper, and we show additional results on the MSLR-WEB30K and the simulated data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found