Measuring Harmful Representations in Scandinavian Language Models

Nov-21-2022–arXiv.org Artificial Intelligence

Scandinavian countries are perceived as role-models when it comes to gender equality. With the advent of pre-trained language models and their widespread usage, we investigate to what extent gender-based harmful and toxic content exist in selected Scandinavian language models. We examine nine models, covering Danish, Swedish, and Norwegian, by manually creating template-based sentences and probing the models for completion. We evaluate the completions using two methods for measuring harmful and toxic completions and provide a thorough analysis of the results. We show that Scandinavian pre-trained language models contain harmful and gender-based stereotypes with similar values across all languages. This finding goes against the general expectations related to gender equality in Scandinavian countries and shows the possible problematic outcomes of using such models in real-world settings.

artificial intelligence, computational linguistic, natural language, (16 more...)

arXiv.org Artificial Intelligence

Nov-21-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Washington > King County
    - Seattle (0.04)
  - New York > New York County
    - New York City (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
- Europe
  - Denmark (0.14)
  - Iceland > Capital Region
    - Reykjavik (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Finland > Southwest Finland
    - Turku (0.04)
  - Italy
    - Tuscany > Florence (0.04)
    - Lombardy > Milan (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Sweden > Östergötland County
    - Linköping (0.05)
  - Norway > Western Norway
    - Vestland > Bergen (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia > Middle East
  - Israel (0.04)

Genre:
- Overview (0.46)
- Research Report (0.40)

Industry:
- Law > Civil Rights & Constitutional Law (0.70)

Technology:
- Information Technology > Artificial Intelligence > Natural Language (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found