Write It Like You See It: Detectable Differences in Clinical Notes By Race Lead To Differential Model Recommendations
Adam, Hammaad, Yang, Ming Ying, Cato, Kenrick, Baldini, Ioana, Senteio, Charles, Celi, Leo Anthony, Zeng, Jiaming, Singh, Moninder, Ghassemi, Marzyeh
–arXiv.org Artificial Intelligence
Clinical notes are becoming an increasingly important data source for machine learning (ML) applications in healthcare. Prior research has shown that deploying ML models can perpetuate existing biases against racial minorities, as bias can be implicitly embedded in data. In this study, we investigate the level of implicit race information available to ML models and human experts and the implications of model-detectable differences in clinical notes. Our work makes three key contributions. First, we find that models can identify patient self-reported race from clinical notes even when the notes are stripped of explicit indicators of race. Second, we determine that human experts are not able to accurately predict patient race from the same redacted clinical notes. Finally, we demonstrate the potential harm of this implicit information in a simulation study, and show that models trained on these race-redacted clinical notes can still perpetuate existing biases in clinical treatment decisions.
arXiv.org Artificial Intelligence
Nov-1-2022
- Country:
- Africa (0.04)
- North America
- Jamaica (0.04)
- United States
- New York > New York County
- New York City (0.05)
- New Jersey > Middlesex County
- New Brunswick (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Florida > Alachua County
- Gainesville (0.04)
- California > San Francisco County
- San Francisco (0.14)
- New York > New York County
- Canada > Ontario
- Toronto (0.04)
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Estonia > Harju County
- Tallinn (0.04)
- Spain > Catalonia
- Asia
- Middle East > Israel (0.04)
- India > Karnataka
- Bengaluru (0.04)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Technology: