AITopics | ehrshot

Collaborating Authors

ehrshot

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Neural Information Processing SystemsFeb-17-2026, 07:37:37 GMT

We help address these challenges through three contributions. First, we publish a new dataset, EHRSHOT, which contains de-identified structured data from the electronic health records (EHRs) of 6,739 patients from Stanford Medicine.

bioinformatics, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > Experimental Study (0.67)

Industry:

Health & Medicine > Health Care Technology > Medical Record (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Internal Medicine (0.68)

Technology:

Information Technology > Biomedical Informatics (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Information Management (0.88)
(3 more...)

Add feedback

EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

Neural Information Processing SystemsDec-26-2025, 21:04:56 GMT

While the general machine learning (ML) community has benefited from public datasets, tasks, and models, the progress of ML in healthcare has been hampered by a lack of such shared assets. The success of foundation models creates new challenges for healthcare ML by requiring access to shared pretrained models to validate performance benefits. We help address these challenges through three contributions. First, we publish a new dataset, EHRSHOT, which contains de-identified structured data from the electronic health records (EHRs) of 6,739 patients from Stanford Medicine. Unlike MIMIC-III/IV and other popular EHR datasets, EHRSHOT is longitudinal and not restricted to ICU/ED patients.

ehr benchmark, ehrshot, few-shot evaluation, (5 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.55)

Industry: Health & Medicine > Health Care Technology > Medical Record (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Cross-Representation Benchmarking in Time-Series Electronic Health Records for Clinical Outcome Prediction

Chen, Tianyi, Zhu, Mingcheng, Luo, Zhiyao, Zhu, Tingting

arXiv.org Artificial IntelligenceOct-13-2025

Electronic Health Records (EHRs) enable deep learning for clinical predictions, but the optimal method for representing patient data remains unclear due to inconsistent evaluation practices. We present the first systematic benchmark to compare EHR representation methods, including multivariate time-series, event streams, and textual event streams for LLMs. This benchmark standardises data curation and evaluation across two distinct clinical settings: the MIMIC-IV dataset for ICU tasks (mortality, phenotyping) and the EHRSHOT dataset for longitudinal care (30-day readmission, 1-year pancreatic cancer). For each paradigm, we evaluate appropriate modelling families--including Transformers, MLP, LSTMs and Retain for time-series, CLMBR and count-based models for event streams, 8-20B LLMs for textual streams--and analyse the impact of feature pruning based on data missingness. Our experiments reveal that event stream models consistently deliver the strongest performance. Pre-trained models like CLMBR are highly sample-efficient in few-shot settings, though simpler count-based models can be competitive given sufficient data. Furthermore, we find that feature selection strategies must be adapted to the clinical setting: pruning sparse features improves ICU predictions, while retaining them is critical for longitudinal tasks. Our results, enabled by a unified and reproducible pipeline, provide practical guidance for selecting EHR representations based on the clinical context and data regime.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2510.09159

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.86)

Genre: