Combining Crowd and Expert Labels Using Decision Theoretic Active Learning
Nguyen, An Thanh (University of Texas at Austin) | Wallace, Byron C. (University of Texas at Austin) | Lease, Matthew (University of Texas at Austin)
We consider a finite-pool data categorization scenario which requires exhaustively classifying a given set of examples with a limited budget. We adopt a hybrid human-machine approach which blends automatic machine learning with human labeling across a tiered workforce composed of domain experts and crowd workers. To effectively achieve high-accuracy labels over the instances in the pool at minimal cost, we develop a novel approach based on decision-theoretic active learning. On the important task of biomedical citation screening for systematic reviews, results on real data show that our method achieves consistent improvements over baseline strategies. To foster further research by others, we have made our data available online.
Nov-1-2015
- Country:
- North America > United States
- New York (0.04)
- Texas > Travis County
- Austin (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Health & Medicine (1.00)
- Technology: