Towards countering hate speech and personal attack in social media

Charitidis, Polychronis, Doropoulos, Stavros, Vologiannidis, Stavros, Papastergiou, Ioannis, Karakeva, Sophia

Dec-5-2019–arXiv.org Machine Learning

The damaging effects of hate speech in social media are evident during the last few years, and several organizations, researchers and the social media platforms themselves have tried to harness them without great success. Recently, following the advent of deep learning, several novel approaches appeared in the field of hate speech detection. However, it is apparent that such approaches depend on large-scale datasets in order to exhibit competitive performance. In this paper, we present a novel, publicly available collection of datasets in five different languages, that consists of tweets referring to journalism-related accounts, including high-quality human annotations for hate speech and personal attack. To build the datasets we follow a concise annotation strategy and employ an active learning approach. Additionally, we present a number of state-of-the-art deep learning architectures for hate speech detection and use these datasets to train and evaluate them. Finally, we propose an ensemble model that outperforms all individual models.

dataset, speech, tweet, (15 more...)

arXiv.org Machine Learning

Dec-5-2019

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Western Australia > Perth (0.04)
- North America
  - United States
    - Hawaii (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - New Mexico > Santa Fe County
      - Santa Fe (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Switzerland > Geneva
    - Geneva (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Greece > Central Macedonia
    - Thessaloniki (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Industry:
- Law > Civil Rights & Constitutional Law (0.87)
- Information Technology
  - Services (1.00)
  - Security & Privacy (0.68)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found