TheLabelComplexityofActiveLearningfrom ObservationalData

Neural Information Processing Systems 

In this problem, the learner is given observational data - a set of examples selected according to some policy along with their labels - as well as access to the policy that selects the examples, and the goal is to construct a classifier with high performance on an entire population, notjusttheobservational data distribution.