Kataria, Saurabh
GPT-PPG: A GPT-based Foundation Model for Photoplethysmography Signals
Chen, Zhaoliang, Ding, Cheng, Kataria, Saurabh, Yan, Runze, Wang, Minxiao, Lee, Randall, Hu, Xiao
This study introduces a novel application of a Generative Pre-trained Transformer (GPT) model tailored for photoplethysmography (PPG) signals, serving as a foundation model for various downstream tasks. Adapting the standard GPT architecture to suit the continuous characteristics of PPG signals, our approach demonstrates promising results. Our models are pre-trained on our extensive dataset that contains more than 200 million 30s PPG samples. We explored different supervised fine-tuning techniques to adapt our model to downstream tasks, resulting in performance comparable to or surpassing current state-of-the-art (SOTA) methods in tasks like atrial fibrillation detection. A standout feature of our GPT model is its inherent capability to perform generative tasks such as signal denoising effectively, without the need for further fine-tuning. This success is attributed to the generative nature of the GPT framework. Keywords: Foundation model, PPG, Generative Pre-trained Transformer 1. Introduction The emergence of large language models (LLMs) such as BERT [1] and GPT [2] has revolutionized the field of artificial intelligence by introducing the concept of foundation models. These models, characterized by extensive pre-training on large datasets without explicit supervision, demonstrate remarkable versatility across downstream tasks via fine-tuning.
Fusion of ECG Foundation Model Embeddings to Improve Early Detection of Acute Coronary Syndromes
Meng, Zeyuan, Panchumarthi, Lovely Yeswanth, Kataria, Saurabh, Fedorov, Alex, Zègre-Hemsey, Jessica, Hu, Xiao, Xiao, Ran
Acute Coronary Syndrome (ACS) is a life - threatening cardiovascular condition where early and accurate diagnosis is critical for effective treatment and improved patient outcomes. This study explores the use of ECG foundation models, specifically ST - MEM and ECG - FM, to enhance ACS risk assessment using prehospital ECG data collected in the ambulances . Both models leverage self - supervised learning (SSL), with ST - MEM using a reconstruction - based approach and ECG - FM employing contrastive learning, capt uring unique spatial and temporal ECG features. We evaluate the performance of these models individually and through a fusion approach, where their embeddings are combined for enhanced prediction. Results demonstrate that both foundation models outperform a baseline ResNet - 50 model, with the fusion - based approach achieving the highest perf ormance (AUROC: 0.843 0.006, AUCPR: 0.674 0.012). These findings highlight the potential of ECG foundation models for early ACS detection and motivate further exploration of advanced fusion strategies to maximize complementary feature utilization.
Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model
Kataria, Saurabh, Xiao, Ran, Ruchti, Timothy, Clark, Matthew, Lu, Jiaying, Lee, Randall J., Grunwell, Jocelyn, Hu, Xiao
Non-invasive patient monitoring for tracking and predicting adverse acute health events is an emerging area of research. We pursue in-hospital cardiac arrest (IHCA) prediction using only single-channel finger photoplethysmography (PPG) signals. Our proposed two-stage model Feature Extractor-Aggregator Network (FEAN) leverages powerful representations from pre-trained PPG foundation models (PPG-GPT of size up to 1 Billion) stacked with sequential classification models. We propose two FEAN variants ("1H", "FH") which use the latest one-hour and (max) 24-hour history to make decisions respectively. Our study is the first to present IHCA prediction results in ICU patients using only unimodal (continuous PPG signal) waveform deep representations. With our best model, we obtain an average of 0.79 AUROC over 24~h prediction window before CA event onset with our model peaking performance at 0.82 one hour before CA. We also provide a comprehensive analysis of our model through architectural tuning and PaCMAP visualization of patient health trajectory in latent space.
Utilizing Context in Generative Bayesian Models for Linked Corpus
Kataria, Saurabh (Pennsylvania State University) | Mitra, Prasenjit (Pennsylvania State University) | Bhatia, Sumit (Pennsylvania State University)
In an interlinked corpus of documents, the context in which a citation appears provides extra information about the cited document. However, associating terms in the context to the cited document remains an open problem. We propose a novel document generation approach that statistically incorporates the context in which a document links to another document. We quantitatively show that the proposed generation scheme explains the linking phenomenon better than previous approaches. The context information along with the actual content of the document provides significant improvements over the previous approaches for various real world evaluation tasks such as link prediction and log-likelihood estimation on unseen content. The proposed method is more scalable to large collection of documents compared to the previous approaches.