Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus

Song, Xingyi, Petrak, Johann, Jiang, Ye, Singh, Iknoor, Maynard, Diana, Bontcheva, Kalina

Jun-5-2020–arXiv.org Machine Learning

The explosion of disinformation related to the COVID-19 pandemic has overloaded fact-checkers and media worldwide. To help tackle this, we developed computational methods to support COVID-19 disinformation debunking and social impacts research. This paper presents: 1) the currently largest available manually annotated COVID-19 disinformation category dataset; and 2) a classification-aware neural topic model (CANTM) that combines classification and topic modelling under a variational autoencoder framework. We demonstrate that CANTM efficiently improves classification performance with low resources, and is scalable. In addition, the classification-aware topics help researchers and end-users to better understand the classification results.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

Jun-5-2020

arXiv.org PDF

Add feedback

Country:
- Oceania > Palau (0.04)
- South America
  - Brazil (0.04)
  - Ecuador > Guayas Province
    - Guayaquil (0.04)
- North America > United States
  - Oregon (0.04)
  - Kansas (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Italy (0.05)
  - Croatia (0.04)
  - Germany (0.04)
  - Spain (0.04)
  - Romania (0.04)
  - Ireland (0.04)
  - Middle East > Malta
    - Port Region > Southern Harbour District > Valletta (0.04)
  - United Kingdom > England
    - South Yorkshire > Sheffield (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Indonesia (0.04)
  - Singapore (0.04)
  - Philippines (0.04)
  - Pakistan (0.04)
  - Japan (0.04)
  - Middle East
    - Jordan (0.04)
    - Israel (0.04)
  - India
    - Maharashtra > Mumbai (0.04)
    - Karnataka (0.04)
    - Chandigarh (0.04)
  - China
    - Hubei Province > Wuhan (0.05)
    - Beijing > Beijing (0.04)
- Africa
  - Middle East > Libya (0.04)
  - South Africa > Gauteng
    - Johannesburg (0.04)

Genre:
- Research Report (0.40)

Industry:
- Health & Medicine
  - Epidemiology (1.00)
  - Therapeutic Area
    - Infections and Infectious Diseases (1.00)
    - Immunology (1.00)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Neural Networks (1.00)
    - Natural Language > Discourse & Dialogue (0.86)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found