AITopics

2412.10467

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Greece > Central Macedonia > Thessaloniki (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Sánchez-Cortés, Dairazalia, Burdisso, Sergio, Villatoro-Tello, Esaú, Motlicek, Petr

Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions

arXiv.org Artificial IntelligenceOct-23-2024

Bias assessment of news sources is paramount for professionals, organizations, and researchers who rely on truthful evidence for information gathering and reporting. While certain bias indicators are discernible from content analysis, descriptors like political bias and fake news pose greater challenges. In this paper, we propose an extension to a recently presented news media reliability estimation method that focuses on modeling outlets and their longitudinal web interactions. Concretely, we assess the classification performance of four reinforcement learning strategies on a large news media hyperlink graph. Our experiments, targeting two challenging bias descriptors, factual reporting and political bias, showed a significant performance improvement at the source media level. Additionally, we validate our methods on the CLEF 2023 CheckThat! Lab challenge, outperforming the reported results in both, F1-score and the official MAE metric. Furthermore, we contribute by releasing the largest annotated dataset of news source media, categorized with factual reporting and political bias labels. Our findings suggest that profiling news media sources based on their hyperlink interactions over time is feasible, offering a bird's-eye view of evolving media landscapes.

information retrieval, machine learning, reinforcement learning, (18 more...)

doi: 10.1007/978-3-031-71736-9_7

2410.17655

Country:

Europe > Switzerland (0.04)
Europe > Czechia > South Moravian Region > Brno (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.37)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Lei, Yuanyuan, Huang, Ruihong

Identifying Conspiracy Theories News based on Event Relation Graph

arXiv.org Artificial IntelligenceOct-27-2023

Conspiracy theories, as a type of misinformation, are narratives that explains an event or situation in an irrational or malicious manner. While most previous work examined conspiracy theory in social media short texts, limited attention was put on such misinformation in long news documents. In this paper, we aim to identify whether a news article contains conspiracy theories. We observe that a conspiracy story can be made up by mixing uncorrelated events together, or by presenting an unusual distribution of relations between events. Achieving a contextualized understanding of events in a story is essential for detecting conspiracy theories. Thus, we propose to incorporate an event relation graph for each article, in which events are nodes, and four common types of event relations, coreference, temporal, causal, and subevent relations, are considered as edges. Then, we integrate the event relation graph into conspiracy theory identification in two ways: an event-aware language model is developed to augment the basic language model with the knowledge of events and event relations via soft labels; further, a heterogeneous graph attention network is designed to derive a graph embedding based on hard labels. Experiments on a large benchmark dataset show that our approach based on event relation graph improves both precision and recall of conspiracy theory identification, and generalizes well for new unseen media sources.

conspiracy theory, event relation graph, relation, (10 more...)

2310.18545

Country:

North America > United States > Texas > Brazos County > College Station (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > New Mexico > Santa Fe County > Santa Fe (0.04)
(11 more...)

Genre: Research Report (0.82)

Industry:

Media > News (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.47)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

arXiv.org Artificial IntelligenceSep-8-2023

NewB: 200,000+ Sentences for Political Bias Detection

Wei, Jerry

We present the Newspaper Bias Dataset (NewB), a text corpus of more than 200,000 sentences from eleven news sources regarding Donald Trump. While previous datasets have labeled sentences as either liberal or conservative, NewB covers the political views of eleven popular media sources, capturing more nuanced political viewpoints than a traditional binary classification system does. We train two state-of-the-art deep learning models to predict the news source of a given sentence from eleven newspapers and find that a recurrent neural network achieved top-1, top-3, and top-5 accuracies of 33.3%, 61.4%, and 77.6%, respectively, significantly outperforming a baseline logistic regression model's accuracies of 18.3%, 42.6%, and 60.8%. Using the news source label of sentences, we analyze the top n-grams with our model to gain meaningful insight into the portrayal of Trump by media sources.We hope that the public release of our dataset will encourage further research in using natural language processing to analyze more complex political biases. Our dataset is posted at https://github.com/JerryWeiAI/NewB .

dataset, neural network, news source, (15 more...)

2006.03051

Country:

North America > United States > New York (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)
(4 more...)

Genre: Research Report > New Finding (0.35)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Krstovski, Kriste, Ryu, Angela Soomin, Kogut, Bruce

Evons: A Dataset for Fake and Real News Virality Analysis and Prediction

arXiv.org Artificial IntelligenceSep-16-2022

We present a novel collection of news articles originating from fake and real news media sources for the analysis and prediction of news virality. Unlike existing fake news datasets which either contain claims or news article headline and body, in this collection each article is supported with a Facebook engagement count which we consider as an indicator of the article virality. In addition we also provide the article description and thumbnail image with which the article was shared on Facebook. These images were automatically annotated with object tags and color attributes. Using cloud based vision analysis tools, thumbnail images were also analyzed for faces and detected faces were annotated with facial attributes. We empirically investigate the use of this collection on an example task of article virality prediction.

artificial intelligence, machine learning, social media, (16 more...)

2209.08129

Country: North America > United States (0.15)

Genre: Research Report (0.64)

Industry:

Media > News (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.47)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Daily Mail - Science & techJun-7-2022, 15:19:09 GMT

Social media misinformation threatens 'scientific credibility', report says

Britons' trust in science is at an all-time high after the Covid pandemic, a new report reveals – but misinformation on social media continues to present a'threat to scientific credibility'. The 3M State of Science Index, published on Tuesday, reveals that 90 per cent of UK residents trust science in 2022, compared with 85 per cent in 2019. This stat also compares with 88 per cent of Europeans and 89 per cent of people globally who trust science in 2022. In the UK, 57 per cent of Brits say they are now more appreciative of science after the pandemic, likely due to the efforts of scientists in creating Covid vaccines. However, misinformation'is widespread' on social media and threatens the future of the public's understanding of science, the report says.

science-based fact, scientist, social media, (12 more...)

Daily Mail - Science & tech

Country:

Europe > United Kingdom (0.73)
South America > Brazil (0.05)
Oceania > Australia (0.05)
(9 more...)

Genre: Research Report (0.88)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.72)

D'Alonzo, Samantha, Tegmark, Max

Machine-Learning media bias

arXiv.org Artificial IntelligenceAug-31-2021

We present an automated method for measuring media bias. Inferring which newspaper published a given article, based only on the frequencies with which it uses different phrases, leads to a conditional probability distribution whose analysis lets us automatically map newspapers and phrases into a bias space. By analyzing roughly a million articles from roughly a hundred newspapers for bias in dozens of news topics, our method maps newspapers into a two-dimensional bias landscape that agrees well with previous bias classifications based on human judgement. One dimension can be interpreted as traditional left-right bias, the other as establishment bias. This means that although news bias is inherently political, its measurement need not be.

artificial intelligence, machine learning, natural language, (18 more...)

doi: 10.1371/journal.pone.0271947

2109.00024

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.34)

Gao, Yue, Raskutti, Garvesh

Improved Prediction and Network Estimation Using the Monotone Single Index Multi-variate Autoregressive Model

arXiv.org Machine LearningJun-28-2021

Network estimation from multi-variate point process or time series data is a problem of fundamental importance. Prior work has focused on parametric approaches that require a known parametric model, which makes estimation procedures less robust to model mis-specification, non-linearities and heterogeneities. In this paper, we develop a semi-parametric approach based on the monotone single-index multi-variate autoregressive model (SIMAM) which addresses these challenges. We provide theoretical guarantees for dependent data and an alternating projected gradient descent algorithm. Significantly we do not explicitly assume mixing conditions on the process (although we do require conditions analogous to restricted strong convexity) and we achieve rates of the form $O(T^{-\frac{1}{3}} \sqrt{s\log(TM)})$ (optimal in the independent design case) where $s$ is the threshold for the maximum in-degree of the network that indicates the sparsity level, $M$ is the number of actors and $T$ is the number of time points. In addition, we demonstrate the superior performance both on simulated data and two real data examples where our SIMAM approach out-performs state-of-the-art parametric methods both in terms of prediction and network estimation.

improved prediction and network estimation, simam, time sery, (12 more...)

2106.1463

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Zheng, Lili, Raskutti, Garvesh, Willett, Rebecca, Mark, Benjamin

Context-dependent self-exciting point processes: models, methods, and risk bounds in high dimensions

arXiv.org Machine LearningMar-16-2020

High-dimensional autoregressive point processes model how current events trigger or inhibit future events, such as activity by one member of a social network can affect the future activity of his or her neighbors. While past work has focused on estimating the underlying network structure based solely on the times at which events occur on each node of the network, this paper examines the more nuanced problem of estimating context-dependent networks that reflect how features associated with an event (such as the content of a social media post) modulate the strength of influences among nodes. Specifically, we leverage ideas from compositional time series and regularization methods in machine learning to conduct network estimation for high-dimensional marked point processes. Two models and corresponding estimators are considered in detail: an autoregressive multinomial model suited to categorical marks and a logistic-normal model suited to marks with mixed membership in different categories. Importantly, the logistic-normal model leads to a convex negative log-likelihood objective and captures dependence across categories. We provide theoretical guarantees for both estimators, which we validate by simulations and a synthetic data-generating model. We further validate our methods through two real data examples and demonstrate the advantages and disadvantages of both approaches.

bern, category, media source, (17 more...)

2003.07429

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New York (0.04)
Asia > Middle East > Jordan (0.04)
(2 more...)

Genre: Research Report (0.69)

Industry:

Media > News (1.00)
Government (0.93)
Information Technology (0.66)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
(2 more...)

arXiv.org Machine LearningJun-7-2019

FAKTA: An Automatic End-to-End Fact Checking System

Nadeem, Moin, Fang, Wei, Xu, Brian, Mohtarami, Mitra, Glass, James

Then, the stance detection component detects the With the rapid increase of fake news in social media stance/perspective of each relevant document with and its negative influence on people and public respect to the claim, typically modeled using labels opinion (Mihaylov et al., 2015; Mihaylov and such as agree, disagree and discuss. This Nakov, 2016; Vosoughi et al., 2018), various organizations component further provides rationales at the sentence are now performing manual fact checking level for explaining model predictions (see on suspicious claims.

artificial intelligence, machine learning, natural language, (16 more...)

1906.04164

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre: Research Report (0.50)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)