A Group-Specific Approach to NLP for Hate Speech Detection
–arXiv.org Artificial Intelligence
Automatic hate speech detection is an important yet complex task, requiring knowledge of common sense, stereotypes of protected groups, and histories of discrimination, each of which may constantly evolve. In this paper, we propose a group-specific approach to NLP for online hate speech detection. The approach consists of creating and infusing historical and linguistic knowledge about a particular protected group into hate speech detection models, analyzing historical data about discrimination against a protected group to better predict spikes in hate speech against that group, and critically evaluating hate speech detection models through lenses of intersectionality and ethics. We demonstrate this approach through a case study on NLP for detection of antisemitic hate speech. The case study synthesizes the current English-language literature on NLP for antisemitism detection, introduces a novel knowledge graph of antisemitic history and language from the 20th century to the present, infuses information from the knowledge graph into a set of tweets over Logistic Regression and uncased DistilBERT baselines, and suggests that incorporating context from the knowledge graph can help models pick up subtle stereotypes.
arXiv.org Artificial Intelligence
Apr-21-2023
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > Israel
- Jerusalem District > Jerusalem (0.04)
- Europe
- Austria > Vienna (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Greece (0.04)
- Hungary (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- District of Columbia > Washington (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Canada > Quebec
- Asia
- Genre:
- Research Report
- Experimental Study (0.34)
- New Finding (0.48)
- Research Report
- Industry:
- Government > Regional Government (0.46)
- Law > Civil Rights & Constitutional Law (0.47)
- Media (0.93)
- Technology: