Investigating the Capabilities and Limitations of Machine Learning for Identifying Bias in English Language Data with Information and Heritage Professionals
Havens, Lucy, Bach, Benjamin, Terras, Melissa, Alex, Beatrice
–arXiv.org Artificial Intelligence
Despite numerous efforts to mitigate their biases, ML systems continue to harm already-marginalized people. While predominant ML approaches assume bias can be removed and fair models can be created, we show that these are not always possible, nor desirable, goals. We reframe the problem of ML bias by creating models to identify biased language, drawing attention to a dataset's biases rather than trying to remove them. Then, through a workshop, we evaluated the models for a specific use case: workflows of information and heritage professionals. Our findings demonstrate the limitations of ML for identifying bias due to its contextual nature, the way in which approaches to mitigating it can simultaneously privilege and oppress different communities, and its inevitability. We demonstrate the need to expand ML approaches to bias and fairness, providing a mixed-methods approach to investigating the feasibility of removing bias or achieving fairness in a given ML use case.
arXiv.org Artificial Intelligence
Apr-1-2025
- Country:
- Africa (0.04)
- Asia
- India (0.04)
- Japan > Honshū
- Kantō > Kanagawa Prefecture > Yokohama (0.05)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Europe
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- France > Auvergne-Rhône-Alpes
- Germany > Hamburg (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom
- England
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.14)
- Scotland
- City of Aberdeen > Aberdeen (0.04)
- City of Edinburgh > Edinburgh (0.04)
- City of Glasgow > Glasgow (0.04)
- England
- Croatia > Dubrovnik-Neretva County
- North America > United States
- New York > New York County
- New York City (0.05)
- Washington > King County
- Seattle (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- California > Los Angeles County
- Los Angeles (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- South America > Brazil
- Rio de Janeiro > Rio de Janeiro (0.04)
- Genre:
- Research Report > New Finding (0.86)
- Industry:
- Education > Curriculum
- Subject-Specific Education (0.45)
- Government (0.93)
- Health & Medicine (1.00)
- Law > Civil Rights & Constitutional Law (0.46)
- Education > Curriculum
- Technology: