FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models
–Neural Information Processing Systems
The advent of foundation models (FMs) in healthcare offers unprecedented opportunities to enhance medical diagnostics through automated classification and segmentation tasks. However, these models also raise significant concerns about their fairness, especially when applied to diverse and underrepresented populations in healthcare applications. Currently, there is a lack of comprehensive benchmarks, standardized pipelines, and easily adaptable libraries to evaluate and understand the fairness performance of FMs in medical imaging, leading to considerable challenges in formulating and implementing solutions that ensure equitable outcomes across diverse patient populations. To fill this gap, we introduce FairMedFM, a fairness benchmark for FM research in medical imaging. It explores 20 widely used FMs, with various usages such as zero-shot learning, linear probing, parameter-efficient fine-tuning, and prompting in various downstream tasks -- classification and segmentation. Our exhaustive analysis evaluates the fairness performance over different evaluation metrics from multiple perspectives, revealing the existence of bias, varied utility-fairness trade-offs on different FMs, consistent disparities on the same datasets regardless FMs, and limited effectiveness of existing unfairness mitigation methods.
Neural Information Processing Systems
May-27-2025, 16:19:53 GMT
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (0.96)
- Health Care Technology (0.96)
- Health & Medicine
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language (0.61)
- Vision (0.96)
- Information Technology > Artificial Intelligence