What Are They Filtering Out? A Survey of Filtering Strategies for Harm Reduction in Pretraining Datasets

Stranisci, Marco Antonio, Hardmeier, Christian

Feb-17-2025–arXiv.org Artificial Intelligence

Data filtering strategies are a crucial component to develop safe Large Language Models (LLM), since they support the removal of harmful contents from pretraining datasets. There is a lack of research on the actual impact of these strategies on vulnerable groups to discrimination, though, and their effectiveness has not been yet systematically addressed. In this paper we present a benchmark study of data filtering strategies for harm reduction aimed at providing a systematic overview on these approaches. We survey 55 technical reports of English LMs and LLMs to identify the existing filtering strategies in literature and implement an experimental setting to test their impact against vulnerable groups. Our results show that the positive impact that strategies have in reducing harmful contents from documents has the side effect of increasing the underrepresentation of vulnerable groups to discrimination in datasets.

arxiv preprint arxiv, computational linguistic, dataset, (13 more...)

arXiv.org Artificial Intelligence

Feb-17-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand
  - North Island > Auckland Region > Auckland (0.04)
- North America
  - United States
    - Virginia (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Florida > Miami-Dade County
      - Miami (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Middle East > Malta (0.04)
  - Italy (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia
  - Singapore (0.04)
  - Middle East > Jordan (0.04)
  - Thailand > Bangkok
    - Bangkok (0.05)

Genre:
- Research Report > New Finding (0.86)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.61)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found