Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content

Kasu, Sai Kartheek Reddy, Biradar, Shankar, Saumya, Sunil

Mar-20-2025–arXiv.org Artificial Intelligence

This paper presents the Deceptive Humor Dataset (DHD), a novel resource for studying humor derived from fabricated claims and misinformation. In an era of rampant misinformation, understanding how humor intertwines with deception is essential. DHD consists of humor-infused comments generated from false narratives, incorporating fabricated claims and manipulated information using the ChatGPT-4o model. Each instance is labeled with a Satire Level, ranging from 1 for subtle satire to 3 for high-level satire and classified into five distinct Humor Categories: Dark Humor, Irony, Social Commentary, Wordplay, and Absurdity. The dataset spans multiple languages including English, Telugu, Hindi, Kannada, Tamil, and their code-mixed variants (Te-En, Hi-En, Ka-En, Ta-En), making it a valuable multilingual benchmark. By introducing DHD, we establish a structured foundation for analyzing humor in deceptive contexts, paving the way for a new research direction that explores how humor not only interacts with misinformation but also influences its perception and spread. We establish strong baselines for the proposed dataset, providing a foundation for future research to benchmark and advance deceptive humor detection models.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Mar-20-2025

arXiv.org PDF

Add feedback

Country:
- Europe (0.04)
- Africa (0.04)
- North America > United States
  - New York (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
- Asia
  - China (0.05)
  - India
    - Andhra Pradesh (0.04)
    - Tamil Nadu (0.04)
    - Manipur > Imphal (0.04)
    - Karnataka (0.04)

Genre:
- Research Report > New Finding (0.68)

Industry:
- Government (1.00)
- Media > News (0.91)
- Health & Medicine > Therapeutic Area
  - Immunology (0.95)
  - Infections and Infectious Diseases (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found