Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus
Song, Xingyi, Petrak, Johann, Jiang, Ye, Singh, Iknoor, Maynard, Diana, Bontcheva, Kalina
The explosion of disinformation related to the COVID-19 pandemic has overloaded fact-checkers and media worldwide. To help tackle this, we developed computational methods to support COVID-19 disinformation debunking and social impacts research. This paper presents: 1) the currently largest available manually annotated COVID-19 disinformation category dataset; and 2) a classification-aware neural topic model (CANTM) that combines classification and topic modelling under a variational autoencoder framework. We demonstrate that CANTM efficiently improves classification performance with low resources, and is scalable. In addition, the classification-aware topics help researchers and end-users to better understand the classification results.
Jun-5-2020
- Country:
- Oceania > Palau (0.04)
- South America
- Brazil (0.04)
- Ecuador > Guayas Province
- Guayaquil (0.04)
- North America > United States
- Europe
- Austria > Vienna (0.14)
- Italy (0.05)
- Croatia (0.04)
- Germany (0.04)
- Spain (0.04)
- Romania (0.04)
- Ireland (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Indonesia (0.04)
- Singapore (0.04)
- Philippines (0.04)
- Pakistan (0.04)
- Japan (0.04)
- Middle East
- India
- Maharashtra > Mumbai (0.04)
- Karnataka (0.04)
- Chandigarh (0.04)
- China
- Hubei Province > Wuhan (0.05)
- Beijing > Beijing (0.04)
- Africa
- Middle East > Libya (0.04)
- South Africa > Gauteng
- Johannesburg (0.04)
- Genre:
- Research Report (0.40)
- Industry:
- Technology: