A Multi-Level Benchmark for Causal Language Understanding in Social Media Discourse
Ding, Xiaohan, Ping, Kaike, Çarık, Buse, Rho, Eugenia
–arXiv.org Artificial Intelligence
Understanding causal language in informal discourse is a core yet underexplored challenge in NLP. Existing datasets largely focus on explicit causality in structured text, providing limited support for detecting implicit causal expressions, particularly those found in informal, user-generated social media posts. We introduce CausalTalk, a multi-level dataset of five years of Reddit posts (2020-2024) discussing public health related to the COVID-19 pandemic, among which 10120 posts are annotated across four causal tasks: (1) binary causal classification, (2) explicit vs. implicit causality, (3) cause-effect span extraction, and (4) causal gist generation. Annotations comprise both gold-standard labels created by domain experts and silver-standard labels generated by GPT-4o and verified by human annotators. CausalTalk bridges fine-grained causal detection and gist-based reasoning over informal text. It enables benchmarking across both discriminative and generative models, and provides a rich resource for studying causal reasoning in social media contexts.
arXiv.org Artificial Intelligence
Sep-23-2025
- Country:
- Asia
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- Myanmar > Tanintharyi Region
- Europe
- Spain (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Washington (0.04)
- Massachusetts (0.04)
- Pennsylvania (0.04)
- Colorado (0.04)
- Illinois (0.04)
- Virginia (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Ohio (0.04)
- Michigan (0.04)
- Arizona (0.04)
- Wisconsin (0.04)
- Texas (0.04)
- California > Orange County (0.04)
- Minnesota (0.04)
- Mexico > Mexico City
- Asia
- Genre:
- Research Report (0.82)
- Industry:
- Technology: