The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Mar-18-2025, 07:42:33 GMT–Neural Information Processing Systems

This work proposes a new challenge set for multimodal classification, focusing on detecting hate speech in multimodal memes. It is constructed such that unimodal models struggle and only multimodal models can succeed: difficult examples ("benign confounders") are added to the dataset to make it hard to rely on unimodal signals. The task requires subtle reasoning, yet is straightforward to evaluate as a binary classification problem. We provide baseline performance numbers for unimodal models, as well as for multimodal models with various degrees of sophistication. We find that state-of-the-art methods perform poorly compared to humans, illustrating the difficulty of the task and highlighting the challenge that this important problem poses to the community.

arxiv preprint arxiv, machine learning, pattern recognition, (17 more...)

Neural Information Processing Systems

Mar-18-2025, 07:42:33 GMT

Conferences PDF

Add feedback

Country:
- North America (0.46)

Genre:
- Research Report (0.34)

Industry:
- Information Technology > Services (1.00)
- Law Enforcement & Public Safety > Terrorism (0.68)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks (0.46)
      - Pattern Recognition (0.47)
    - Natural Language (1.00)
    - Representation & Reasoning > Information Fusion (0.46)
    - Vision (1.00)
  - Communications > Social Media (1.00)

Duplicate Docs Excel Report

Title
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Similar Docs Excel Report more

Title	Similarity	Source
None found