Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

Yuxuan, Cao, Jiayang, Wu, Chuen, Alistair Cheong Liang, Guanrong, Bryan Shan, Jen, Theodore Lee Chong, Shen, Sherman Chann Zhi

Mar-8-2025–arXiv.org Artificial Intelligence

Traditional online content moderation systems struggle to classify modern multimodal means of communication, such as memes, a highly nuanced and information-dense medium. This task is especially hard in a culturally diverse society like Singapore, where low-resource languages are used and extensive knowledge on local context is needed to interpret online content. We curate a large collection of 112K memes labeled by GPT-4V for fine-tuning a VLM to classify offensive memes in Singapore context. We show the effectiveness of fine-tuned VLMs on our dataset, and propose a pipeline containing OCR, translation and a 7-billion parameter-class VLM. Our solutions reach 80.62% accuracy and 0.8192 AUROC on a held-out test set, and can greatly aid human in moderating online contents. The dataset, code, and model weights have been open-sourced at https://github.com/aliencaocao/vlm-for-memes-aisg.

dataset, meme, singapore, (11 more...)

arXiv.org Artificial Intelligence

Mar-8-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China (0.04)
  - Indonesia > Bali (0.04)
  - Singapore > Central Region
    - Singapore (0.04)
  - Southeast Asia (0.04)
- Europe
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
- North America
  - Dominican Republic (0.04)
  - United States > New York
    - New York County > New York City (0.04)

Genre:
- Research Report (1.00)

Industry:
- Government > Regional Government (0.46)
- Information Technology (0.68)
- Law > Civil Rights & Constitutional Law (0.46)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)
    - Natural Language > Large Language Model (1.00)
  - Communications > Social Media (1.00)