An Investigation of Memorization Risk in Healthcare Foundation Models

Jun-23-2026, 05:32:13 GMT–Neural Information Processing Systems

Foundation models trained on large-scale de-identified electronic health records (EHRs) hold promise for clinical applications. However, their capacity to memorize patient information raises important privacy concerns. In this work, we introduce a suite of black-box evaluation tests to assess privacy-related memorization risks in foundation models trained on structured EHR data. Our framework includes methods for probing memorization at both the embedding and generative levels, and aims to distinguish between model generalization and harmful memorization in clinically relevant settings. We contextualize memorization in terms of its potential to compromise patient privacy, particularly for vulnerable subgroups.

information, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Jun-23-2026, 05:32:13 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > Ontario (0.28)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine
  - Pharmaceuticals & Biotechnology (1.00)
  - Health Care Technology > Medical Record (1.00)
  - Consumer Health (1.00)
  - Therapeutic Area
    - Psychiatry/Psychology (1.00)
    - Infections and Infectious Diseases (1.00)
    - Immunology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.94)
  - Machine Learning > Memory-Based Learning
    - Rote Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found