Promoting Fairness and Diversity in Speech Datasets for Mental Health and Neurological Disorders Research

Mancini, Eleonora, Tanevska, Ana, Galassi, Andrea, Galatolo, Alessio, Ruggeri, Federico, Torroni, Paolo

Jun-6-2024–arXiv.org Artificial Intelligence

Current research in machine learning and artificial intelligence is largely centered on modeling and performance evaluation, less so on data collection. However, recent research demonstrated that limitations and biases in data may negatively impact trustworthiness and reliability. These aspects are particularly impactful on sensitive domains such as mental health and neurological disorders, where speech data are used to develop AI applications aimed at improving the health of patients and supporting healthcare providers. In this paper, we chart the landscape of available speech datasets for this domain, to highlight possible pitfalls and opportunities for improvement and promote fairness and diversity. We present a comprehensive list of desiderata for building speech datasets for mental health and neurological disorders and distill it into a checklist focused on ethical concerns to foster more responsible research.

dataset, discourse genre, information, (11 more...)

arXiv.org Artificial Intelligence

Jun-6-2024

arXiv.org PDF

Add feedback

Country:
- South America > Colombia
  - Meta Department > Villavicencio (0.04)
- North America
  - United States
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Illinois > Cook County
      - Chicago (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Slovenia (0.04)
  - Sweden > Uppsala County
    - Uppsala (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Emilia-Romagna
    - Metropolitan City of Bologna > Bologna (0.05)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Iceland > Capital Region
    - Reykjavik (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
- Asia
  - Singapore (0.04)
  - South Korea
    - Seoul > Seoul (0.04)
    - Incheon > Incheon (0.04)
  - China > Guangdong Province
    - Shenzhen (0.04)

Genre:
- Research Report > New Finding (1.00)
- Overview (1.00)

Industry:
- Health & Medicine > Therapeutic Area
  - Neurology (1.00)
  - Psychiatry/Psychology > Mental Health (0.93)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Communications > Social Media (0.94)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning (1.00)
    - Speech (0.93)
    - Issues > Social & Ethical Issues (0.93)
    - Cognitive Science (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found