Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP
Gautam, Vagrant, Subramonian, Arjun, Lauscher, Anne, Keyes, Os
–arXiv.org Artificial Intelligence
Personal names simultaneously differentiate individuals and categorize them in ways that are important in a given society. While the natural language processing community has thus associated personal names with sociodemographic characteristics in a variety of tasks, researchers have engaged to varying degrees with the established methodological problems in doing so. To guide future work that uses names and sociodemographic characteristics, we provide an overview of relevant research: first, we present an interdisciplinary background on names and naming. We then survey the issues inherent to associating names with sociodemographic attributes, covering problems of validity (e.g., systematic error, construct validity), as well as ethical concerns (e.g., harms, differential impact, cultural insensitivity). Finally, we provide guiding questions along with normative recommendations to avoid validity and ethical pitfalls when dealing with names and sociodemographic characteristics in natural language processing.
arXiv.org Artificial Intelligence
Jul-15-2024
- Country:
- Oceania
- Micronesia (0.04)
- Australia > South Australia
- Adelaide (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.05)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California > Los Angeles County
- Los Angeles (0.14)
- Texas > Travis County
- Canada
- Quebec > Montreal (0.04)
- Ontario > Toronto (0.04)
- British Columbia (0.04)
- Europe
- Czechia > Prague (0.04)
- Spain (0.04)
- Iceland (0.04)
- Italy (0.04)
- Sweden (0.04)
- Germany
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- Finland > Pirkanmaa
- Tampere (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- Singapore (0.04)
- Middle East > Israel (0.04)
- China > Hong Kong (0.04)
- South Korea (0.04)
- India (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Oceania
- Genre:
- Overview (1.00)
- Industry:
- Law (1.00)
- Government > Regional Government (0.46)
- Technology: