Correcting Sample Selection Bias by Unlabeled Data
Huang, Jiayuan, Gretton, Arthur, Borgwardt, Karsten M., Schölkopf, Bernhard, Smola, Alex J.
–Neural Information Processing Systems
We consider the scenario where training and test data are drawn from different distributions, commonly referred to as sample selection bias. Most algorithms for this setting try to first recover sampling distributions and then make appropriate correctionsbased on the distribution estimate. We present a nonparametric method which directly produces resampling weights without distribution estimation. Ourmethod works by matching distributions between training and testing sets in feature space. Experimental results demonstrate that our method works well in practice.
Neural Information Processing Systems
Dec-31-2007
- Country:
- Europe > Germany
- Baden-Württemberg > Tübingen Region > Tübingen (0.14)
- North America > United States (0.29)
- Oceania > Australia (0.28)
- Europe > Germany
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Technology: