Synthetic medical data generation: state of the art and application to trauma mechanism classification
Doremus, Océane, Guerra-Adames, Ariel, Avalos-Fernandez, Marta, Jouhet, Vianney, Gil-Jardiné, Cédric, Lagarde, Emmanuel
–arXiv.org Artificial Intelligence
Faced with the challenges of patient confidentiality and scientific reproducibility, research on machine learning for health is turning towards the conception of synthetic medical databases. This article presents a brief overview of state-of-the-art machine learning methods for generating synthetic tabular and textual data, focusing their application to the automatic classification of trauma mechanisms, followed by our proposed methodology for generating high-quality, synthetic medical records combining tabular and unstructured text data. 1 Introduction
arXiv.org Artificial Intelligence
Aug-6-2025