A Multi-Stage Large Language Model Framework for Extracting Suicide-Related Social Determinants of Health
Wang, Song, Wei, Yishu, Ma, Haotian, Lovitt, Max, Deng, Kelly, Meng, Yuan, Xu, Zihan, Zhang, Jingze, Xiao, Yunyu, Ding, Ying, Xu, Xuhai, Ghosh, Joydeep, Peng, Yifan
–arXiv.org Artificial Intelligence
Background: Understanding social determinants of health (SDoH) factors contributing to suicide incidents is crucial for early intervention and prevention. However, data-driven approaches to this goal face challenges such as long-tailed factor distributions, analyzing pivotal stressors preceding suicide incidents, and limited model explainability. Methods: We present a multi-stage large language model framework to enhance SDoH factor extraction from unstructured text. Our approach was compared to other state-of-the-art language models (i.e., pre-trained BioBERT and GPT-3.5-turbo) and reasoning models (i.e., DeepSeek-R1). We also evaluated how the model's explanations help people annotate SDoH factors more quickly and accurately. The analysis included both automated comparisons and a pilot user study. Results: We show that our proposed framework demonstrated performance boosts in the overarching task of extracting SDoH factors and in the finer-grained tasks of retrieving relevant context. Additionally, we show that fine-tuning a smaller, task-specific model achieves comparable or better performance with reduced inference costs. The multi-stage design not only enhances extraction but also provides intermediate explanations, improving model explainability. Conclusions: Our approach improves both the accuracy and transparency of extracting suicide-related SDoH from unstructured texts. These advancements have the potential to support early identification of individuals at risk and inform more effective prevention strategies.
arXiv.org Artificial Intelligence
Aug-8-2025
- Country:
- Asia
- Middle East > Jordan (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- North America
- Puerto Rico (0.04)
- United States
- Alaska (0.04)
- District of Columbia (0.04)
- Texas > Travis County
- Austin (0.14)
- Washington > King County
- Seattle (0.04)
- Asia
- Genre:
- Questionnaire & Opinion Survey (1.00)
- Research Report
- Experimental Study (0.46)
- New Finding (0.46)
- Industry:
- Technology: