Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets
Cheremetiev, Vassiliy, Ngo, Quang Long Ho, Kot, Chau Ying, Baia, Alina Elena, Cavallaro, Andrea
–arXiv.org Artificial Intelligence
Implicit hate speech (IHS) is indirect language that conveys prejudice or hatred through subtle cues, sarcasm or coded terminology. IHS is challenging to detect as it does not include explicit derogatory or inflammatory words. To address this challenge, task-specific pipelines can be complemented with external knowledge or additional information such as context, emotions and sentiment data. In this paper, we show that, by solely fine-tuning recent general-purpose embedding models based on large language models (LLMs), such as Stella, Jasper, NV-Embed and E5, we achieve state-of-the-art performance. Experiments on multiple IHS datasets show up to 1.10 percentage points improvements for in-dataset, and up to 20.35 percentage points improvements in cross-dataset evaluation, in terms of F1-macro score.
arXiv.org Artificial Intelligence
Aug-29-2025
- Country:
- Asia
- China (0.04)
- India (0.04)
- Middle East
- Israel (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- Spain
- Aragón (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Switzerland > Vaud
- Lausanne (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- United States
- New York > New York County
- New York City (0.04)
- Washington > King County
- Seattle (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Nevada (0.04)
- California > San Diego County
- San Diego (0.04)
- South Carolina (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Georgia > Bibb County
- Macon (0.04)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- Canada
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Government > Immigration & Customs (0.93)
- Law > Civil Rights & Constitutional Law (1.00)
- Law Enforcement & Public Safety > Terrorism (1.00)
- Technology: