A survey on bias in machine learning research

Mikołajczyk-Bareła, Agnieszka, Grochowski, Michał

Aug-22-2023–arXiv.org Artificial Intelligence

Current research on bias in machine learning often focuses on fairness, while overlooking the roots or causes of bias. However, bias was originally defined as a "systematic error," often caused by humans at different stages of the research process. This article aims to bridge the gap between past literature on bias in research by providing taxonomy for potential sources of bias and errors in data and models. The paper focus on bias in machine learning pipelines. Survey analyses over forty potential sources of bias in the machine learning (ML) pipeline, providing clear examples for each. By understanding the sources and consequences of bias in machine learning, better methods can be developed for its detecting and mitigating, leading to fairer, more transparent, and more accurate ML models.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Aug-22-2023

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - Greenland (0.04)
  - Dominican Republic (0.04)
  - United States
    - Virginia (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
- Europe
  - United Kingdom > England (0.04)
  - Ireland (0.04)
  - Spain > Galicia
    - A Coruña Province > Santiago de Compostela (0.04)
  - Poland > Pomerania Province
    - Gdańsk (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
- Asia
  - India (0.04)
  - China > Hong Kong (0.04)

Genre:
- Research Report > Experimental Study (1.00)
- Overview (1.00)

Industry:
- Law (0.67)
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Consumer Health (0.93)
  - Epidemiology (0.68)
  - Public Health (0.67)
  - Health Care Providers & Services (0.67)
  - Nuclear Medicine (0.67)
  - Therapeutic Area
    - Oncology (1.00)
    - Dermatology (0.93)
    - Cardiology/Vascular Diseases (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found