PRISM: Leveraging Prototype Patient Representations with Feature-Missing-Aware Calibration for EHR Data Sparsity Mitigation
Zhu, Yinghao, Wang, Zixiang, He, Long, Xie, Shiyun, Ma, Liantao, Pan, Chengwei
–arXiv.org Artificial Intelligence
Electronic Health Record (EHR) data, while rich in information, often suffers from sparsity, posing significant challenges in predictive modeling. Traditional imputation methods inadequately distinguish between real and imputed data, leading to potential inaccuracies in models. Addressing this, we introduce PRISM, a novel approach that indirectly imputes data through prototype representations of similar patients, thus ensuring denser and more accurate embeddings. PRISM innovates further with a feature confidence learner module, which evaluates the reliability of each feature in light of missing data. Additionally, it incorporates a novel patient similarity metric that accounts for feature confidence, avoiding overreliance on imprecise imputed values. Our extensive experiments on the MIMIC-III and MIMIC-IV datasets demonstrate PRISM's superior performance in predicting in-hospital mortality and 30-day readmission tasks, showcasing its effectiveness in handling EHR data sparsity. For the sake of reproducibility and further research, we have made the code publicly available at https://github.com/yhzhu99/PRISM.
arXiv.org Artificial Intelligence
Jan-25-2024
- Country:
- Asia > China (0.14)
- North America > United States (0.14)
- Genre:
- Research Report > Experimental Study (0.46)
- Industry:
- Health & Medicine
- Diagnostic Medicine (0.68)
- Health Care Technology > Medical Record (0.55)
- Therapeutic Area (1.00)
- Health & Medicine
- Technology: