Off-PolicyEvaluationandLearning forExternalValidityunderaCovariateShift

Neural Information Processing Systems 

Although the standard OPE and OPL methods assume the same distribution of covariate between the historical and evaluation data, a covariate shift often exists in real-world applications, i.e., the distribution of the covariate of the historical data is different from that of the evaluationdata.