CHASM Unveiling Covert Advertisements on Chinese Social Media
–Neural Information Processing Systems
Current benchmarks for evaluating large language models (LLMs) in social media moderation completely overlook a serious threat: covert advertisements, which disguise themselves as regular posts to deceive and mislead consumers into making purchases, leading to significant ethical and legal concerns. In this paper, we present the CHASM, a first-of-its-kind dataset designed to evaluate the capability of Multimodal Large Language Models (MLLMs) in detecting covert advertisements on social media. CHASM3 is a high-quality, anonymized, manually curated dataset consisting of 4,992 instances, based on real-world scenarios from the Chinese social media platform Rednote. The dataset was collected and annotated under strict privacy protection and quality control protocols. It includes many product experience sharing posts that closely resemble covert advertisements, making the dataset particularly challenging.
Neural Information Processing Systems
Jun-21-2026, 15:31:33 GMT
- Country:
- North America > United States (0.46)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Media > News (1.00)
- Marketing (1.00)
- Law (1.00)
- Information Technology
- Services (1.00)
- Security & Privacy (1.00)
- Technology: