DEPTWEET: A Typology for Social Media Texts to Detect Depression Severities
Kabir, Mohsinul, Ahmed, Tasnim, Hasan, Md. Bakhtiar, Laskar, Md Tahmid Rahman, Joarder, Tarun Kumar, Mahmud, Hasan, Hasan, Kamrul
–arXiv.org Artificial Intelligence
Mental health research through data-driven methods has been hindered by a lack of standard typology and scarcity of adequate data. In this study, we leverage the clinical articulation of depression to build a typology for social media texts for detecting the severity of depression. It emulates the standard clinical assessment procedure Diagnostic and Statistical Manual of Mental Disorders (DSM-5) and Patient Health Questionnaire (PHQ-9) to encompass subtle indications of depressive disorders from tweets. Along with the typology, we present a new dataset of 40191 tweets labeled by expert annotators. Each tweet is labeled as 'non-depressed' or 'depressed'. Moreover, three severity levels are considered for 'depressed' tweets: (1) mild, (2) moderate, and (3) severe. An associated confidence score is provided with each label to validate the quality of annotation. We examine the quality of the dataset via representing summary statistics while setting strong baseline results using attention-based models like BERT and DistilBERT. Finally, we extensively address the limitations of the study to provide directions for further research.
arXiv.org Artificial Intelligence
Oct-10-2022
- Country:
- South America > Chile
- North America
- United States
- Maryland > Baltimore (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- Colorado > Denver County
- Denver (0.04)
- Arizona > Maricopa County
- Scottsdale (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Spain > Catalonia
- Asia
- Middle East > Qatar
- Japan > Honshū
- Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.04)
- Genre:
- Research Report > New Finding (0.86)
- Industry:
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Communications > Social Media (1.00)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language
- Text Processing (0.93)
- Machine Translation (0.67)
- Machine Learning
- Statistical Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology