Kakarmath, Sujay
HeAR -- Health Acoustic Representations
Baur, Sebastien, Nabulsi, Zaid, Weng, Wei-Hung, Garrison, Jake, Blankemeier, Louis, Fishman, Sam, Chen, Christina, Kakarmath, Sujay, Maimbolwa, Minyoi, Sanjase, Nsala, Shuma, Brian, Matias, Yossi, Corrado, Greg S., Patel, Shwetak, Shetty, Shravya, Prabhakara, Shruthi, Muyoyeta, Monde, Ardila, Diego
Health acoustic sounds such as coughs and breaths are known to contain useful health signals with significant potential for monitoring health and disease, yet are underexplored in the medical machine learning community. The existing deep learning systems for health acoustics are often narrowly trained and evaluated on a single task, which is limited by data and may hinder generalization to other tasks. To mitigate these gaps, we develop HeAR, a scalable self-supervised learning-based deep learning system using masked autoencoders trained on a large dataset of 313 million two-second long audio clips. Through linear probes, we establish HeAR as a state-of-the-art health audio embedding model on a benchmark of 33 health acoustic tasks across 6 datasets. By introducing this work, we hope to enable and accelerate further health acoustics research.
Deploying clinical machine learning? Consider the following...
Lu, Charles, Chang, Ken, Singh, Praveer, Pomerantz, Stuart, Doyle, Sean, Kakarmath, Sujay, Bridge, Christopher, Kalpathy-Cramer, Jayashree
Despite the intense attention and considerable investment into clinical machine learning research, relatively few applications have been deployed at a large-scale in a real-world clinical environment. While research is important in advancing the state-of-the-art, translation is equally important in bringing these techniques and technologies into a position to ultimately impact healthcare. We believe a lack of appreciation for several considerations are a major cause for this discrepancy between expectation and reality. To better characterize a holistic perspective among researchers and practitioners, we survey several practitioners with commercial experience in developing CML for clinical deployment. Using these insights, we identify several main categories of challenges in order to better design and develop clinical machine learning applications.
Predicting Cardiovascular Disease Risk using Photoplethysmography and Deep Learning
Weng, Wei-Hung, Baur, Sebastien, Daswani, Mayank, Chen, Christina, Harrell, Lauren, Kakarmath, Sujay, Jabara, Mariam, Behsaz, Babak, McLean, Cory Y., Matias, Yossi, Corrado, Greg S., Shetty, Shravya, Prabhakara, Shruthi, Liu, Yun, Danaei, Goodarz, Ardila, Diego
Cardiovascular diseases (CVDs) are responsible for a large proportion of premature deaths in low- and middle-income countries. Early CVD detection and intervention is critical in these populations, yet many existing CVD risk scores require a physical examination or lab measurements, which can be challenging in such health systems due to limited accessibility. Here we investigated the potential to use photoplethysmography (PPG), a sensing technology available on most smartphones that can potentially enable large-scale screening at low cost, for CVD risk prediction. We developed a deep learning PPG-based CVD risk score (DLS) to predict the probability of having major adverse cardiovascular events (MACE: non-fatal myocardial infarction, stroke, and cardiovascular death) within ten years, given only age, sex, smoking status and PPG as predictors. We compared the DLS with the office-based refit-WHO score, which adopts the shared predictors from WHO and Globorisk scores (age, sex, smoking status, height, weight and systolic blood pressure) but refitted on the UK Biobank (UKB) cohort. In UKB cohort, DLS's C-statistic (71.1%, 95% CI 69.9-72.4) was non-inferior to office-based refit-WHO score (70.9%, 95% CI 69.7-72.2; non-inferiority margin of 2.5%, p<0.01). The calibration of the DLS was satisfactory, with a 1.8% mean absolute calibration error. Adding DLS features to the office-based score increased the C-statistic by 1.0% (95% CI 0.6-1.4). DLS predicts ten-year MACE risk comparable with the office-based refit-WHO score. It provides a proof-of-concept and suggests the potential of a PPG-based approach strategies for community-based primary prevention in resource-limited regions.