Improving Consensus Accuracy via Z-Score and Weighted Voting
Jung, Hyun Joon (University of Texas at Austin) | Lease, Matthew (University of Texas at Austin)
Using supervised and unsupervised features individually or together, we (a) detect and filter out noisy workers via Z-score, and (b) weight worker votes for consensus labeling. We evaluate on noisy labels from Amazon Mechanical Turk in which workers judge Web search relevance of query/document pairs. In comparison to a majority vote baseline, results show a 6% error reduction (48.83% to 51.91%) for graded accuracy and 5% error reduction (64.88% to 68.33%) for binary accuracy.
Aug-8-2011
- Country:
- North America > United States > Texas > Travis County > Austin (0.05)
- Genre:
- Research Report > New Finding (0.36)
- Technology: