Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices
Sigurgeirsson, Atli, Ungless, Eddie L.
–arXiv.org Artificial Intelligence
Modern voice cloning models claim to be able to capture a diverse range of voices. We test the ability of a typical pipeline to capture the style known colloquially as "gay voice" and notice a homogenisation effect: synthesised speech is rated as sounding significantly "less gay" (by LGBTQ+ participants) than its corresponding ground-truth for speakers with "gay voice", but ratings actually increase for control speakers. Loss of "gay voice" has implications for accessibility. We also find that for speakers with "gay voice", loss of "gay voice" corresponds to lower similarity ratings. However, we caution that improving the ability of such models to synthesise ``gay voice'' comes with a great number of risks. We use this pipeline as a starting point for a discussion on the ethics of modelling queer voices more broadly. Collecting "clean" queer data has safety and fairness ramifications, and the resulting technology may cause harms from mockery to death.
arXiv.org Artificial Intelligence
Jun-11-2024
- Country:
- Asia > South Korea
- Europe
- Germany > Saxony
- Leipzig (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Germany > Saxony
- North America > United States
- New York > New York County > New York City (0.04)
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (0.93)
- Information Technology > Security & Privacy (0.67)
- Technology: