Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Ganesan, Adithya V, Varadarajan, Vasudha, Lal, Yash Kumar, Eijsbroek, Veerle C., Kjell, Katarina, Kjell, Oscar N. E., Dhanasekaran, Tanuja, Stade, Elizabeth C., Eichstaedt, Johannes C., Boyd, Ryan L., Schwartz, H. Andrew, Flek, Lucie
–arXiv.org Artificial Intelligence
Use of large language models such as ChatGPT (GPT-4) for mental health support has grown rapidly, emerging as a promising route to assess and help people with mood disorders, like depression. However, we have a limited understanding of GPT-4's schema of mental disorders, that is, how it internally associates and interprets symptoms. In this work, we leveraged contemporary measurement theory to decode how GPT-4 interrelates depressive symptoms to inform both clinical utility and theoretical understanding. We found GPT-4's assessment of depression: (a) had high overall convergent validity (r = .71 with self-report on 955 samples, and r = .81 with experts judgments on 209 samples); (b) had moderately high internal consistency (symptom inter-correlates r = .23 to .78 ) that largely aligned with literature and self-report; except that GPT-4 (c) underemphasized suicidality's -- and overemphasized psychomotor's -- relationship with other symptoms, and (d) had symptom inference patterns that suggest nuanced hypotheses (e.g. sleep and fatigue are influenced by most other symptoms while feelings of worthlessness/guilt is mostly influenced by depressed mood).
arXiv.org Artificial Intelligence
Nov-20-2024
- Country:
- Europe > Germany (0.67)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.94)
- Workflow (1.00)
- Research Report
- Industry:
- Technology: