Promoting Fairness and Diversity in Speech Datasets for Mental Health and Neurological Disorders Research
Mancini, Eleonora, Tanevska, Ana, Galassi, Andrea, Galatolo, Alessio, Ruggeri, Federico, Torroni, Paolo
–arXiv.org Artificial Intelligence
Current research in machine learning and artificial intelligence is largely centered on modeling and performance evaluation, less so on data collection. However, recent research demonstrated that limitations and biases in data may negatively impact trustworthiness and reliability. These aspects are particularly impactful on sensitive domains such as mental health and neurological disorders, where speech data are used to develop AI applications aimed at improving the health of patients and supporting healthcare providers. In this paper, we chart the landscape of available speech datasets for this domain, to highlight possible pitfalls and opportunities for improvement and promote fairness and diversity. We present a comprehensive list of desiderata for building speech datasets for mental health and neurological disorders and distill it into a checklist focused on ethical concerns to foster more responsible research.
arXiv.org Artificial Intelligence
Jun-6-2024
- Country:
- Asia (0.67)
- Europe (1.00)
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Pennsylvania (0.14)
- Minnesota > Hennepin County
- Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Therapeutic Area
- Neurology (1.00)
- Psychiatry/Psychology > Mental Health (0.93)
- Health & Medicine > Therapeutic Area
- Technology:
- Information Technology
- Artificial Intelligence
- Cognitive Science (0.93)
- Issues > Social & Ethical Issues (0.93)
- Machine Learning (1.00)
- Natural Language (1.00)
- Speech (0.93)
- Communications > Social Media (0.94)
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Information Technology