LTCR: Long-Text Chinese Rumor Detection Dataset
Ma, Ziyang, Liu, Mengsha, Fang, Guian, Shen, Ying
–arXiv.org Artificial Intelligence
False information can spread quickly on social media, negatively influencing the citizens' behaviors and responses to social events. To better detect all of the fake news, especially long texts which are harder to find completely, a Long-Text Chinese Rumor detection dataset named LTCR is proposed. The LTCR dataset provides a valuable resource for accurately detecting misinformation, especially in the context of complex fake news related to COVID-19. The dataset consists of 1,729 and 500 pieces of real and fake news, respectively. The average lengths of real and fake news are approximately 230 and 152 characters. We also propose \method, Salience-aware Fake News Detection Model, which achieves the highest accuracy (95.85%), fake news recall (90.91%) and F-score (90.60%) on the dataset. (https://github.com/Enderfga/DoubleCheck)
arXiv.org Artificial Intelligence
Jun-13-2023
- Country:
- Asia
- China
- Fujian Province > Fuzhou (0.04)
- Hubei Province > Wuhan (0.05)
- Pakistan > Islamabad Capital Territory
- Islamabad (0.04)
- China
- Europe
- Germany (0.04)
- Middle East > Cyprus (0.04)
- Asia
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine > Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Media > News (1.00)
- Health & Medicine > Therapeutic Area
- Technology: