Fake News Detection with Different Models

Vijayaraghavan, Sairamvinay, Wang, Ye, Guo, Zhiyuan, Voong, John, Xu, Wenda, Nasseri, Armand, Cai, Jiaru, Li, Linda, Vuong, Kevin, Wadhwa, Eshan

Feb-15-2020–arXiv.org Machine Learning

Problem: The problem we intend to solve is modelled as a binary classification problem. We intend to find the relation in the words and the context in which the words appear within the text and how it could be used to classify texts as real (negative cases) or fake (positive). High-level description: Many news sources contain false information and are therefore "fake news." Because there is a lot of "fake news" articles and fabricated, misleading information on the web, we would like to determine which texts are legitimate (real) and which are illegitimate (fake). To solve this as a binary classification problem, we investigate the effectiveness of different Natural Language Processing models which are used to convert character based texts into numeric representations such as TFIDF, CountVectorizer and Word2Vec models and find out which model is able to preserve most of the contextual information about the text used in a fake news data set and how helpful and effective it is in detecting whether the text is a fake news or not.

algorithm, grid search, representation, (14 more...)

arXiv.org Machine Learning

Feb-15-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States > New York (0.04)

Genre:
- Research Report (1.00)

Industry:
- Media > News (1.00)
- Government > Regional Government
  - North America Government > United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found