Perceptual Implications of Automatic Anonymization in Pathological Speech
Arasteh, Soroosh Tayebi, Afza, Saba, Nguyen, Tri-Thien, Buess, Lukas, Parvin, Maryam, Arias-Vergara, Tomas, Perez-Toro, Paula Andrea, Hung, Hiu Ching, Lotfinia, Mahshad, Gorges, Thomas, Noeth, Elmar, Schuster, Maria, Yang, Seung Hee, Maier, Andreas
–arXiv.org Artificial Intelligence
Automatic anonymization techniques are essential for ethical sharing of pathological speech data, yet their perceptual consequences remain understudied. We present a comprehensive human-centered analysis of anonymized pathological speech, using a structured protocol involving ten native and non-native German listeners with diverse linguistic, clinical, and technical backgrounds. Listeners evaluated anonymized-original utterance pairs from 180 speakers spanning Cleft Lip and Palate, Dysarthria, Dysglossia, Dysphonia, and healthy controls. Speech was anonymized using state-of-the-art automatic methods (equal error rates in the range of 30-40%). Listeners completed Turing-style discrimination and quality rating tasks under zero-shot (single-exposure) and few-shot (repeated-exposure) conditions. Discrimination accuracy was high overall (91% zero-shot; 93% few-shot), but varied by disorder (repeated-measures ANOVA: p=0.007), ranging from 96% (Dysarthria) to 86% (Dysphonia). Anonymization consistently reduced perceived quality across groups (from 83% to 59%, p<0.001), with pathology-specific degradation patterns (one-way ANOVA: p=0.005). Native listeners showed a non-significant trend toward higher original speech ratings (Delta=4%, p=0.199), but this difference was minimal after anonymization (Delta=1%, p=0.724). No significant gender-based bias was observed. Perceptual outcomes did not correlate with automatic metrics; intelligibility was linked to perceived quality in original speech but not after anonymization. These findings underscore the need for listener-informed, disorder-specific anonymization strategies that preserve both privacy and perceptual integrity.
arXiv.org Artificial Intelligence
Aug-25-2025
- Country:
- Asia
- Europe
- Austria > Vienna (0.14)
- Finland (0.04)
- Germany
- Bavaria
- Middle Franconia > Nuremberg (0.04)
- Upper Bavaria > Munich (0.04)
- Hamburg (0.04)
- North Rhine-Westphalia
- Cologne Region > Aachen (0.04)
- Upper Bavaria > Munich (0.04)
- Bavaria
- Greece (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Piedmont
- Turin Province > Turin (0.04)
- Netherlands > South Holland
- Rotterdam (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- North America
- Canada > Alberta
- United States
- California
- San Diego County > San Diego (0.04)
- Santa Clara County > Stanford (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Rhode Island (0.04)
- California
- Oceania > Australia
- Queensland > Brisbane (0.04)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Health & Medicine
- Diagnostic Medicine (1.00)
- Therapeutic Area > Neurology (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.93)
- Performance Analysis > Accuracy (0.87)
- Natural Language (1.00)
- Speech > Speech Recognition (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence