LAVA: Language Model Assisted Verbal Autopsy for Cause-of-Death Determination
Chen, Yiqun T., McCormick, Tyler H., Liu, Li, Datta, Abhirup
–arXiv.org Artificial Intelligence
Verbal autopsy (VA) is a critical tool for estimating causes of death in resource-limited settings where medical certification is unavailable. This study presents LA-VA, a proof-of-concept pipeline that combines Large Language Models (LLMs) with traditional algorithmic approaches and embedding-based classification for improved cause-of-death prediction. Using the Population Health Metrics Research Consortium (PHMRC) dataset across three age categories (Adult: 7,580; Child: 1,960; Neonate: 2,438), we evaluate multiple approaches: GPT-5 predictions, LCVA baseline, text embed-dings, and meta-learner ensembles. Our results demonstrate that GPT-5 achieves the highest individual performance with average test site accuracies of 48.6% (Adult), 50.5% (Child), and 53.5% (Neonate), outperforming traditional statistical machine learning baselines by 5-10%. Our findings suggest that simple off-the-shelf LLM-assisted approaches could substantially improve verbal autopsy accuracy, with important implications for global health surveillance in low-resource settings.
arXiv.org Artificial Intelligence
Sep-12-2025
- Country:
- Africa
- Mozambique > Cabo Delgado Province
- Pemba (0.05)
- Tanzania > Dar es Salaam Region
- Dar es Salaam (0.04)
- Mozambique > Cabo Delgado Province
- Asia
- India
- Andhra Pradesh (0.04)
- Uttar Pradesh (0.04)
- Philippines > Visayas
- Central Visayas > Province of Bohol (0.04)
- India
- North America > Mexico
- Mexico City > Mexico City (0.04)
- Africa
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine
- Public Health (1.00)
- Therapeutic Area
- Gastroenterology (0.69)
- Immunology (0.71)
- Infections and Infectious Diseases (1.00)
- Oncology (1.00)
- Pediatrics/Neonatology (1.00)
- Health & Medicine
- Technology: