Seeing the Unseen: Errors and Bias in Visual Datasets

Nov-3-2022–arXiv.org Artificial Intelligence

Introduction From face recognition in smartphones to automatic routing on self-driving cars, machine vision algorithms lie in the core of these features. These systems solve image based tasks by identifying and understanding objects, subsequently making decisions from these information. A large set of images where the featured objects were labelled, known as datasets, are commonly used to develop and enhance machine vision algorithms (Cox 2016). However, errors in datasets are usually induced or even magnified in algorithms, at times resulting in issues such as recognising black people as gorillas and misrepresenting ethnicities in search results (Nieva 2015; Prabhu and Birhane 2020). This essay tracks the errors in datasets and their impacts, revealing that a flawed dataset could be a result of limited categories, incomprehensive sourcing and poor classification.

artificial intelligence, dataset, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Nov-3-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Florida > Miami-Dade County
    - Miami (0.04)
- Asia
  - India (0.04)
  - South Korea > Seoul
    - Seoul (0.04)

Genre:
- Research Report (0.50)

Industry:
- Information Technology (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Machine Learning (1.00)
  - Robots > Autonomous Vehicles (0.54)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found