Knowledge Graph informed Fake News Classification via Heterogeneous Representation Ensembles
Koloski, Boshko, Stepišnik-Perdih, Timen, Robnik-Šikonja, Marko, Pollak, Senja, Škrlj, Blaž
–arXiv.org Artificial Intelligence
Increasing amounts of freely available data both in textual and relational form offers exploration of richer document representations, potentially improving the model performance and robustness. An emerging problem in the modern era is fake news detection -- many easily available pieces of information are not necessarily factually correct, and can lead to wrong conclusions or are used for manipulation. In this work we explore how different document representations, ranging from simple symbolic bag-of-words, to contextual, neural language model-based ones can be used for efficient fake news identification. One of the key contributions is a set of novel document representation learning methods based solely on knowledge graphs, i.e. extensive collections of (grounded) subject-predicate-object triplets. We demonstrate that knowledge graph-based representations already achieve competitive performance to conventionally accepted representation learners. Furthermore, when combined with existing, contextual representations, knowledge graph-based document representations can achieve state-of-the-art performance. To our knowledge this is the first larger-scale evaluation of how knowledge graph-based representations can be systematically incorporated into the process of fake news classification.
arXiv.org Artificial Intelligence
Oct-20-2021
- Country:
- Oceania > Australia
- North America
- United States
- Texas (0.04)
- Nevada (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- North Carolina > Wake County
- Raleigh (0.04)
- New York
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Francisco County > San Francisco (0.14)
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- United States
- Europe
- France (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.05)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Media > News (1.00)
- Government (1.00)
- Health & Medicine > Therapeutic Area
- Immunology (0.48)
- Technology: