Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba
He, Ruiqi, He, Yushu, Bai, Longju, Liu, Jiarui, Sun, Zhenjie, Tang, Zenghao, Wang, He, Xia, Hanchen, Deng, Naihao
–arXiv.org Artificial Intelligence
Existing humor datasets and evaluations predominantly focus on English, lacking resources for culturally nuanced humor in non-English languages like Chinese. To address this gap, we construct Chumor, a dataset sourced from Ruo Zhi Ba (RZB), a Chinese Reddit-like platform dedicated to sharing intellectually challenging and culturally specific jokes. We annotate explanations for each joke and evaluate human explanations against two state-of-the-art LLMs, GPT-4o and ERNIE Bot, through A/B testing by native Chinese speakers. Our evaluation shows that Chumor is challenging even for SOTA LLMs, and the human explanations for Chumor jokes are significantly better than explanations generated by the LLMs.
arXiv.org Artificial Intelligence
Jun-18-2024
- Country:
- Asia
- China
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Singapore (0.05)
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Tuscany
- Florence (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Slovenia (0.04)
- Denmark > Capital Region
- North America
- Canada
- Alberta
- Census Division No. 5
- Kneehill County (0.04)
- Starland County (0.04)
- Census Division No. 7 > Stettler County No. 6 (0.04)
- Census Division No. 8 > Red Deer County (0.04)
- Census Division No. 5
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Ontario > Toronto (0.04)
- Alberta
- United States
- Massachusetts > Suffolk County
- Boston (0.04)
- Michigan (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.04)
- Massachusetts > Suffolk County
- Canada
- South America > Chile
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Technology: