Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection

Saleh, Abdelrhman, Baly, Ramy, Barrón-Cedeño, Alberto, Martino, Giovanni Da San, Mohtarami, Mitra, Nakov, Preslav, Glass, James

Apr-6-2019–arXiv.org Machine Learning

In this paper, we describe our submission to SemEval-2019 Task 4 on Hyperpartisan News Detection. Our system relies on a variety of engineered features originally used to detect propaganda. This is based on the assumption that biased messages are propagandistic in the sense that they promote a particular political cause or viewpoint. We trained a logistic regression model with features ranging from simple bag-of-words to vocabulary richness and text readability features. Our system achieved 72.9% accuracy on the test data that is annotated manually and 60.8% on the test data that is annotated with distant supervision. Additional experiments showed that significant performance improvements can be achieved with better feature pre-processing.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Machine Learning

Apr-6-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)
- Europe (0.68)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Media > News (1.00)
- Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Processing (0.67)
  - Machine Learning > Statistical Learning
    - Regression (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found