Examining Temporal Bias in Abusive Language Detection

Jin, Mali, Mu, Yida, Maynard, Diana, Bontcheva, Kalina

Sep-25-2023–arXiv.org Artificial Intelligence

Previous work identified temporal bias in an Italian hate In recent years, researchers have developed a huge variety speech data set associated with immigrants (Florio et al. of machine learning models that can automatically detect 2020). However, they have yet to explore temporal factors abusive language (Mishra et al. 2019; Aurpa, Sadik, and affecting predictive performance from a multilingual perspective. Ahmed 2022; Das and Mukherjee 2023; Alrashidi, Jamal, In this paper, we explore temporal bias in 5 different and Alkhathlan 2023). However, these models may be subject abusive data sets that span varying time periods, in 4 to temporal bias, which can lead to a decrease in the languages (English, Spanish, Italian, and Chinese). Specifically, accuracy of abusive language detection models, potentially we investigate the following core research questions: allowing abusive language to be undetected or falsely detected. RQ1: How does the magnitude of temporal bias vary across different data sets such as language, time span and Temporal bias arises from differences in populations and collection methods?

chronological split, detection, temporal bias, (14 more...)

arXiv.org Artificial Intelligence

Sep-25-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Mali (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - United States > New York (0.04)
  - Puerto Rico > Peñuelas
    - Peñuelas (0.04)
- Europe
  - United Kingdom > England
    - South Yorkshire > Sheffield (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia > Middle East
  - Palestine (0.14)
  - Syria (0.04)
  - Israel (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.88)
- Law > Civil Rights & Constitutional Law (0.68)
- Government
  - Regional Government (0.66)
  - Immigration & Customs (0.54)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Natural Language > Text Processing (0.68)
    - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found