Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection
Ahmed, Ahmed Haj, Yew, Rui-Jie, Minocher, Xerxes, Venkatasubramanian, Suresh
–arXiv.org Artificial Intelligence
Social media platforms have become central to global communication, yet they also facilitate the spread of hate speech. For underrepresented dialects like Levantine Arabic, detecting hate speech presents unique cultural, ethical, and linguistic challenges. This paper explores the complex sociopolitical and linguistic landscape of Levantine Arabic and critically examines the limitations of current datasets used in hate speech detection. We highlight the scarcity of publicly available, diverse datasets and analyze the consequences of dialectal bias within existing resources. By emphasizing the need for culturally and contextually informed natural language processing (NLP) tools, we advocate for a more nuanced and inclusive approach to hate speech detection in the Arab world.
arXiv.org Artificial Intelligence
Dec-14-2024
- Country:
- Asia
- Middle East
- Israel > Jerusalem District
- Jerusalem (0.05)
- Jordan (0.05)
- Lebanon > Beirut Governorate
- Beirut (0.04)
- Palestine > Gaza Strip
- Gaza Governorate > Gaza (0.05)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Syria
- Aleppo Governorate > Aleppo (0.04)
- Damascus Governorate > Damascus (0.05)
- Idlib Governorate > Idlib (0.04)
- Israel > Jerusalem District
- Singapore (0.04)
- Middle East
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Provence-Alpes-Côte d'Azur
- North America
- Canada > British Columbia
- United States > California (0.04)
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Government (1.00)
- Technology: