A survey of Identification and mitigation of Machine Learning algorithmic biases in Image Analysis
Risser, Laurent, Picard, Agustin, Hervier, Lucas, Loubes, Jean-Michel
–arXiv.org Artificial Intelligence
The ubiquity of Machine Learning (ML) models, and more specifically deep neural network (NN) models, in all sorts of applications has become undeniable in recent years. From classifying images [1, 2, 3], detecting objects [4, 1] and performing semantic segmentation [5, 4] to translating from one human language to another [6] and doing sentiment analysis [7], the advances in different subfields of ML can be attributed mostly to the explosion of computing power and their ability to speed up the training process of artificial NNs. Most famously, AlexNet [8] allowed for an impressive jump in performance in the challenging ILSVRC2012 image classification dataset [1], also known as ImageNet, permanently cementing deep convolutional NN (CNN) architectures in the field of computer vision. Since then, architectures have gotten more refined [9, 10], training procedures have gotten increasingly more complex [11], and their performance and robustness have greatly improved as a consequence. Namely, the success of these deep CNN models is related to their ability to treat high-dimensional and complex data such as images or natural language. The impressive performance of NNs for machine learning tasks can be explained by the ability of their flexible architecture to capture meaningful information on various kinds of complex data and the fact that they are potentially composed of millions of parameters. However, this poses a major challenge: deciphering the reasoning behind the model's predictions. For instance, typical NN architectures for classification or regression problems incrementally transform the representation of the input data in the so-called latent space (or feature space) and then use this transformed representation to make their predictions, as summarized in Figure 1. Each step of this incremental data processing pipeline (or feature extraction chain) is carried out by a so-called layer, which is mathematically a non-linear function (blue rectangle in Figure 1).
arXiv.org Artificial Intelligence
Oct-10-2022
- Country:
- North America
- Europe > France
- Occitanie > Haute-Garonne > Toulouse (0.05)
- Genre:
- Overview (1.00)
- Instructional Material > Course Syllabus & Notes (0.34)
- Industry:
- Law (1.00)
- Government (0.92)
- Information Technology (0.66)
- Health & Medicine
- Diagnostic Medicine > Imaging (0.93)
- Therapeutic Area (0.68)
- Technology: