Correcting Sample Selection Bias by Unlabeled Data

Huang, Jiayuan, Gretton, Arthur, Borgwardt, Karsten, Schölkopf, Bernhard, Smola, Alex J.

Dec-31-2007–Neural Information Processing Systems

We consider the scenario where training and test data are drawn from different distributions, commonly referred to as sample selection bias. Most algorithms for this setting try to first recover sampling distributions and then make appropriate corrections based on the distribution estimate. We present a nonparametric method which directly produces resampling weights without distribution estimation. Our method works by matching distributions between training and testing sets in feature space. Experimental results demonstrate that our method works well in practice.

dataset, sample selection bia, selection bia, (15 more...)

Neural Information Processing Systems

Dec-31-2007

Conferences PDF

Add feedback

Country:
- Asia > India (0.04)
- Oceania > Australia
  - Australian Capital Territory > Canberra (0.04)
- North America
  - United States
    - Georgia > Fulton County
      - Atlanta (0.04)
    - California > Monterey County
      - Pacific Grove (0.04)
  - Canada > Ontario
    - Waterloo Region > Waterloo (0.04)
- Europe > Germany
  - Bavaria > Upper Bavaria
    - Munich (0.04)
  - Baden-Württemberg > Tübingen Region
    - Tübingen (0.14)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Health & Medicine
  - Therapeutic Area > Oncology (1.00)
  - Pharmaceuticals & Biotechnology (0.96)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Duplicate Docs Excel Report

Title
Correcting Sample Selection Bias by Unlabeled Data
Correcting Sample Selection Bias by Unlabeled Data

Similar Docs Excel Report more

Title	Similarity	Source
None found