LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing Systems
Li, Chu, Zhang, Zhihan, Saugstad, Michael, Safranchik, Esteban, Kulkarni, Minchu, Huang, Xiaoyu, Patel, Shwetak, Iyer, Vikram, Althoff, Tim, Froehlich, Jon E.
–arXiv.org Artificial Intelligence
Crowdsourcing platforms have transformed distributed problem-solving, yet quality control remains a persistent challenge. Traditional quality control measures, such as prescreening workers and refining instructions, often focus solely on optimizing economic output. This paper explores just-in-time AI interventions to enhance both labeling quality and domain-specific knowledge among crowdworkers. We introduce LabelAId, an advanced inference model combining Programmatic Weak Supervision (PWS) with FT-Transformers to infer label correctness based on user behavior and domain knowledge. Our technical evaluation shows that our LabelAId pipeline consistently outperforms state-of-the-art ML baselines, improving mistake inference accuracy by 36.7% with 50 downstream samples. We then implemented LabelAId into Project Sidewalk, an open-source crowdsourcing platform for urban accessibility. A between-subjects study with 34 participants demonstrates that LabelAId significantly enhances label precision without compromising efficiency while also increasing labeler confidence. We discuss LabelAId's success factors, limitations, and its generalizability to other crowdsourced science domains.
arXiv.org Artificial Intelligence
Mar-14-2024
- Country:
- Asia
- China
- Jiangsu Province > Nanjing (0.04)
- Zhejiang Province > Hangzhou (0.04)
- Malaysia (0.04)
- Middle East > Jordan (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Taiwan (0.04)
- China
- Europe
- Italy
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland > Dordrecht (0.04)
- Switzerland (0.04)
- United Kingdom
- North America
- Mexico (0.04)
- United States
- New York > New York County
- New York City (0.14)
- California
- Alameda County > Berkeley (0.04)
- Los Angeles County > Los Angeles (0.04)
- San Francisco County > San Francisco (0.14)
- Santa Barbara County > Santa Barbara (0.04)
- Santa Clara County > San Jose (0.04)
- District of Columbia > Washington (0.04)
- Washington > King County
- Seattle (0.14)
- Georgia > Fulton County
- Atlanta (0.14)
- Texas > Bexar County
- San Antonio (0.04)
- Illinois > Cook County
- Chicago (0.05)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Oregon
- Multnomah County > Portland (0.14)
- Yamhill County > Newberg (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York > New York County
- Oceania > New Zealand (0.04)
- South America
- Brazil > Rio de Janeiro
- Rio de Janeiro (0.04)
- Ecuador (0.04)
- Brazil > Rio de Janeiro
- Asia
- Genre:
- Questionnaire & Opinion Survey (1.00)
- Research Report
- Experimental Study > Negative Result (0.67)
- New Finding (1.00)
- Industry:
- Education
- Government (0.67)
- Health & Medicine (1.00)
- Technology: