The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
–Neural Information Processing Systems
This work proposes a new challenge set for multimodal classification, focusing on detecting hate speech in multimodal memes. It is constructed such that unimodal models struggle and only multimodal models can succeed: difficult examples ("benign confounders") are added to the dataset to make it hard to rely on unimodal signals. The task requires subtle reasoning, yet is straightforward to evaluate as a binary classification problem. We provide baseline performance numbers for unimodal models, as well as for multimodal models with various degrees of sophistication. We find that state-of-the-art methods perform poorly compared to humans, illustrating the difficulty of the task and highlighting the challenge that this important problem poses to the community.
Neural Information Processing Systems
Mar-18-2025, 07:42:33 GMT
- Country:
- North America (0.46)
- Genre:
- Research Report (0.34)
- Industry:
- Information Technology > Services (1.00)
- Law Enforcement & Public Safety > Terrorism (0.68)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks (0.46)
- Pattern Recognition (0.47)
- Natural Language (1.00)
- Representation & Reasoning > Information Fusion (0.46)
- Vision (1.00)
- Machine Learning
- Communications > Social Media (1.00)
- Artificial Intelligence
- Information Technology