Representation Bias of Adolescents in AI: A Bilingual, Bicultural Study

Wolfe, Robert, Dangol, Aayushi, Howe, Bill, Hiniker, Alexis

Aug-4-2024–arXiv.org Artificial Intelligence

Popular and news media often portray teenagers with sensationalism, as both a risk to society and at risk from society. As AI begins to absorb some of the epistemic functions of traditional media, we study how teenagers in two countries speaking two languages: 1) are depicted by AI, and 2) how they would prefer to be depicted. Specifically, we study the biases about teenagers learned by static word embeddings (SWEs) and generative language models (GLMs), comparing these with the perspectives of adolescents living in the U.S. and Nepal. We find English-language SWEs associate teenagers with societal problems, and more than 50% of the 1,000 words most associated with teenagers in the pretrained GloVe SWE reflect such problems. Given prompts about teenagers, 30% of outputs from GPT2-XL and 29% from LLaMA-2-7B GLMs discuss societal problems, most commonly violence, but also drug use, mental illness, and sexual taboo. Nepali models, while not free of such associations, are less dominated by social problems. Data from workshops with N=13 U.S. adolescents and N=18 Nepalese adolescents show that AI presentations are disconnected from teenage life, which revolves around activities like school and friendship. Participant ratings of how well 20 trait words describe teens are decorrelated from SWE associations, with Pearson's r=.02, n.s. in English FastText and r=.06, n.s. in GloVe; and r=.06, n.s. in Nepali FastText and r=-.23, n.s. in GloVe. U.S. participants suggested AI could fairly present teens by highlighting diversity, while Nepalese participants centered positivity. Participants were optimistic that, if it learned from adolescents, rather than media sources, AI could help mitigate stereotypes. Our work offers an understanding of the ways SWEs and GLMs misrepresent a developmentally vulnerable group and provides a template for less sensationalized characterization.

adolescent, participant, teenager, (14 more...)

arXiv.org Artificial Intelligence

Aug-4-2024

arXiv.org PDF

Add feedback

Country:
- South America > Colombia (0.04)
- North America > United States
  - California (0.04)
  - Florida (0.04)
  - New York > New York County
    - New York City (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Russia > Volga Federal District
    - Nizhny Novgorod Oblast > Nizhny Novgorod (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
- Asia
  - Nepal > Bagmati Province
    - Kathmandu District > Kathmandu (0.04)
  - China
    - Hong Kong (0.04)
    - Chongqing Province > Chongqing (0.04)

Genre:
- Research Report (1.00)

Industry:
- Media > News (1.00)
- Education (1.00)
- Health & Medicine > Therapeutic Area
  - Psychiatry/Psychology > Mental Health (0.67)
- Government > Regional Government
  - North America Government > United States Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found