Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models
Puttanawarut, Chanon, Fongsrisin, Natcha, Amornritvanich, Porntep, Looareesuwan, Panu, Ratanatharathorn, Cholatid
–arXiv.org Artificial Intelligence
Background: Heart failure (HF) research is constrained by limited access to large, shareable datasets due to privacy regulations and institutional barriers. Synthetic data generation offers a promising solution to overcome these challenges while preserving patient confidentiality. Methods: We generated synthetic HF datasets from institutional data comprising 12,552 unique patients using five deep learning models: tabular variational autoencoder (TVAE), normalizing flow, ADSGAN, SurvivalGAN, and tabular denoising diffusion probabilistic models (TabDDPM). We comprehensively evaluated synthetic data utility through statistical similarity metrics, survival prediction using machine learning and privacy assessments. Results: SurvivalGAN and TabDDPM demonstrated high fidelity to the original dataset, exhibiting similar variable distributions and survival curves after applying histogram equalization. SurvivalGAN (C-indices: 0.71-0.76) and TVAE (C-indices: 0.73-0.76) achieved the strongest performance in survival prediction evaluation, closely matched real data performance (C-indices: 0.73-0.76). Privacy evaluation confirmed protection against re-identification attacks. Conclusions: Deep learning-based synthetic data generation can produce high-fidelity, privacy-preserving HF datasets suitable for research applications. This publicly available synthetic dataset addresses critical data sharing barriers and provides a valuable resource for advancing HF research and predictive modeling.
arXiv.org Artificial Intelligence
Sep-17-2025
- Country:
- Asia
- China (0.04)
- Pakistan (0.04)
- Thailand
- Bangkok > Bangkok (0.04)
- Samut Prakan > Samut Prakan (0.04)
- Europe > Germany (0.04)
- North America
- Canada > Ontario
- Waterloo Region > Waterloo (0.04)
- United States
- New York (0.04)
- Washington > King County
- Seattle (0.04)
- Canada > Ontario
- Asia
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Technology: