A Study on Bias Detection and Classification in Natural Language Processing

Evans, Ana Sofia, Moniz, Helena, Coheur, Luísa

Aug-14-2024–arXiv.org Artificial Intelligence

Human biases have been shown to influence the performance of models and algorithms in various fields, including Natural Language Processing. While the study of this phenomenon is garnering focus in recent years, the available resources are still relatively scarce, often focusing on different forms or manifestations of biases. The aim of our work is twofold: 1) gather publicly-available datasets and determine how to better combine them to effectively train models in the task of hate speech detection and classification; 2) analyse the main issues with these datasets, such as scarcity, skewed resources, and reliance on non-persistent data. We discuss these issues in tandem with the development of our experiments, in which we show that the combinations of different datasets greatly impact the models' performance.

category, dataset, target category, (13 more...)

arXiv.org Artificial Intelligence

Aug-14-2024

arXiv.org PDF

Add feedback

Country:
- Africa > Kenya (0.04)
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
- Europe
  - Serbia (0.04)
  - France (0.04)
  - Eastern Europe (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia
  - China > Hong Kong (0.04)
  - Middle East > Israel (0.04)

Genre:
- Overview (0.93)
- Research Report > New Finding (0.46)

Industry:
- Law (1.00)
- Information Technology (0.93)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Health & Medicine > Therapeutic Area (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found