LionGuard 2: Building Lightweight, Data-Efficient & Localised Multilingual Content Moderators
Tan, Leanne, Chua, Gabriel, Ge, Ziyu, Lee, Roy Ka-Wei
–arXiv.org Artificial Intelligence
Modern moderation systems increasingly support multiple languages, but often fail to address localisation and low-resource variants - creating safety gaps in real-world deployments. Small models offer a potential alternative to large LLMs, yet still demand considerable data and compute. We present LionGuard 2, a lightweight, multilingual moderation classifier tailored to the Singapore context, supporting English, Chinese, Malay, and partial Tamil. Built on pre-trained OpenAI embeddings and a multi-head ordinal classifier, LionGuard 2 outperforms several commercial and open-source systems across 17 benchmarks, including both Singapore-specific and public English datasets. The system is actively deployed within the Singapore Government, demonstrating practical efficacy at scale. Our findings show that high-quality local data and robust multilingual embeddings can achieve strong moderation performance, without fine-tuning large models. We release our model weights and part of our training data to support future work on LLM safety.
arXiv.org Artificial Intelligence
Sep-30-2025
- Country:
- Asia
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.14)
- Singapore (1.00)
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East > UAE
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States (0.04)
- Mexico > Mexico City
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: