GeHirNet: A Gender-Aware Hierarchical Model for Voice Pathology Classification
Wu, Fan, Zhao, Kaicheng, Fleisch, Elgar, Barata, Filipe
–arXiv.org Artificial Intelligence
AI-based voice analysis shows promise for disease diagnostics, but existing classifiers often fail to accurately identify specific pathologies because of gender-related acoustic variations and the scarcity of data for rare diseases. We propose a novel two-stage framework that first identifies gender-specific pathological patterns using ResNet-50 on Mel spectrograms, then performs gender-conditioned disease classification. We address class imbalance through multi-scale resampling and time warping augmentation. Evaluated on a merged dataset from four public repositories, our two-stage architecture with time warping achieves state-of-the-art performance (97.63\% accuracy, 95.25\% MCC), with a 5\% MCC improvement over single-stage baseline. This work advances voice pathology classification while reducing gender bias through hierarchical modeling of vocal characteristics.
arXiv.org Artificial Intelligence
Aug-5-2025
- Country:
- Europe
- Switzerland (0.29)
- Germany (0.28)
- Europe
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Health & Medicine
- Diagnostic Medicine (1.00)
- Therapeutic Area
- Neurology (1.00)
- Musculoskeletal (0.94)
- Health & Medicine
- Technology: