Fake News Detection with Different Models

Vijayaraghavan, Sairamvinay, Wang, Ye, Guo, Zhiyuan, Voong, John, Xu, Wenda, Nasseri, Armand, Cai, Jiaru, Li, Linda, Vuong, Kevin, Wadhwa, Eshan

arXiv.org Machine Learning 

Problem: The problem we intend to solve is modelled as a binary classification problem. We intend to find the relation in the words and the context in which the words appear within the text and how it could be used to classify texts as real (negative cases) or fake (positive). High-level description: Many news sources contain false information and are therefore "fake news." Because there is a lot of "fake news" articles and fabricated, misleading information on the web, we would like to determine which texts are legitimate (real) and which are illegitimate (fake). To solve this as a binary classification problem, we investigate the effectiveness of different Natural Language Processing models which are used to convert character based texts into numeric representations such as TFIDF, CountVectorizer and Word2Vec models and find out which model is able to preserve most of the contextual information about the text used in a fake news data set and how helpful and effective it is in detecting whether the text is a fake news or not.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found