Understanding and Analyzing Inappropriately Targeting Language in Online Discourse: A Comparative Annotation Study
Barbarestani, Baran, Maks, Isa, Vossen, Piek
–arXiv.org Artificial Intelligence
This paper introduces a method for detecting inappropriately targeting language in online conversations by integrating crowd and expert annotations with ChatGPT. We focus on English conversation threads from Reddit, examining comments that target individuals or groups. Our approach involves a comprehensive annotation framework that labels a diverse data set for various target categories and specific target words within the conversational context. We perform a comparative analysis of annotations from human experts, crowd annotators, and ChatGPT, revealing strengths and limitations of each method in recognizing both explicit hate speech and subtler discriminatory language. Our findings highlight the significant role of contextual factors in identifying hate speech and uncover new categories of targeting, such as social belief and body image. We also address the challenges and subjective judgments involved in annotation and the limitations of ChatGPT in grasping nuanced language. This study provides insights for improving automated content moderation strategies to enhance online safety and inclusivity.
arXiv.org Artificial Intelligence
May-23-2025
- Country:
- Africa > South Africa (0.04)
- Asia > Japan
- Honshū > Kansai > Osaka Prefecture > Osaka (0.04)
- Europe
- Finland (0.04)
- France (0.04)
- Ireland (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain (0.04)
- United Kingdom (0.04)
- North America
- Canada (0.04)
- United States (0.14)
- Oceania
- Australia (0.04)
- New Zealand (0.04)
- Genre:
- Research Report
- Experimental Study (0.34)
- New Finding (0.34)
- Research Report
- Technology: