Goto

Collaborating Authors

 misinformation


Over-reliance on chatbots can diminish critical-thinking skills, study finds

The Guardian

TECHNOLOGY IT ARTIFICIAL INTELLIGENCE CHATGPT Illustration picture shows the ChatGPT artificial intelligence software, which generates human-like conversation, Friday 03 February 2023 in Lierde. TECHNOLOGY IT ARTIFICIAL INTELLIGENCE CHATGPT Illustration picture shows the ChatGPT artificial intelligence software, which generates human-like conversation, Friday 03 February 2023 in Lierde. A new study from the Massachusetts Institute of Technology is the latest research to find that relying too much on chatbots can diminish critical-thinking skills, and potentially decrease our ability to discern misinformation for ourselves. As AI tools are becoming more sophisticated and accessible, manipulated images and misleading headlines are becoming more common. AI can be part of the solution, and has proved useful in helping users identify fake content - but there's a cost to using it this way, the new research suggests.


Truth over Tricks: Measuring and Mitigating Shortcut Learning in Misinformation Detection

Neural Information Processing Systems

Misinformation detectors often rely on superficial cues (i.e., shortcuts) that correlate with misinformation in training data but fail to generalize to the diverse and evolving nature of real-world misinformation. This issue is exacerbated by large language models (LLMs), which can easily generate convincing misinformation using simple prompts. We introduce TruthOverTricks, a unified evaluation paradigm for measuring shortcut learning in misinformation detection. TruthOverTricks categorizes shortcut behaviors into intrinsic shortcut induction and extrinsic shortcut injection, and evaluates seven representative detectors across 14 popular benchmarks, along with two new factual misinformation datasets, NQ-Misinfo and Streaming-Misinfo. Empirical results reveal that existing detectors suffer severe performance degradation when exposed to both naturally occurring and adversarially crafted shortcuts. To address this, we propose the Shortcut Mitigation Framework (SMF), an LLM-augmented data augmentation framework that mitigates shortcut reliance through paraphrasing, factual summarization, and sentiment normalization. SMF consistently enhances robustness across 16 benchmarks, forcing models to rely on deeper semantic understanding rather than shortcut cues.


Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models

Neural Information Processing Systems

Vision-Language Models (VLMs) excel in generating textual responses from visual inputs, but their versatility raises security concerns. This study takes the first step in exposing VLMs' susceptibility to data poisoning attacks that can manipulate responses to innocuous, everyday prompts. We introduce Shadowcast, a stealthy data poisoning attack where poison samples are visually indistinguishable from benign images with matching texts. Shadowcast demonstrates effectiveness in two attack types. The first is a traditional Label Attack, tricking VLMs into misidentifying class labels, such as confusing Donald Trump for Joe Biden.


Top AI ethics and policy issues of 2025 and what to expect in 2026

AIHub

This happened as generative and agentic systems became essential in key sectors worldwide. This feature highlights the major AI ethics and policy developments of 2025, and concludes with a forward-looking perspective on the ethical and policy challenges likely to shape 2026.





Elon Musk's Grok AI generates images of 'minors in minimal clothing'

The Guardian

Grok has a history of failing to maintain its safety guardrails and posting misinformation. Grok has a history of failing to maintain its safety guardrails and posting misinformation. Elon Musk's Grok AI generates images of'minors in minimal clothing' Elon Musk's chatbot Grok posted on Friday that lapses in safeguards had led it to generate "images depicting minors in minimal clothing" on social media platform X. The chatbot, a product of Musk's company xAI, has been generating a wave of sexualized images throughout the week in response to user prompts. Screenshots shared by users on X showed Grok's public media tab filled with such images.


Like in past disasters, misinformation spreads online in Aomori quake aftermath

The Japan Times

A damaged concrete pillar supporting the Hachinohe Line in the city of Hachinohe, Aomori Prefecture, on Wednesday. False claims that a powerful earthquake in northern Japan was "human-caused," along with artificial intelligence-generated videos, are spreading rapidly across social media after the quake struck Aomori Prefecture on Monday evening. The earthquake registered an upper 6 on Japan's seismic intensity scale, prompting warnings from the Japan Meteorological Agency (JMA) and the Cabinet Secretariat against the spread of unverified information that could hamper emergency response efforts. Misinformation circulated widely on platforms including X, echoing a pattern seen during previous disasters such as the Noto Peninsula earthquake in January 2024, when false rescue pleas and conspiracy theories also gained traction online. In a time of both misinformation and too much information, quality journalism is more crucial than ever.


Simulating Misinformation Propagation in Social Networks using Large Language Models

arXiv.org Artificial Intelligence

Misinformation on social media thrives on surprise, emotion, and identity-driven reasoning, often amplified through human cognitive biases. To investigate these mechanisms, we model large language model (LLM) personas as synthetic agents that mimic user-level biases, ideological alignments, and trust heuristics. Within this setup, we introduce an auditor--node framework to simulate and analyze how misinformation evolves as it circulates through networks of such agents. News articles are propagated across networks of persona-conditioned LLM nodes, each rewriting received content. A question--answering-based auditor then measures factual fidelity at every step, offering interpretable, claim-level tracking of misinformation drift. We formalize a misinformation index and a misinformation propagation rate to quantify factual degradation across homogeneous and heterogeneous branches of up to 30 sequential rewrites. Experiments with 21 personas across 10 domains reveal that identity- and ideology-based personas act as misinformation accelerators, especially in politics, marketing, and technology. By contrast, expert-driven personas preserve factual stability. Controlled-random branch simulations further show that once early distortions emerge, heterogeneous persona interactions rapidly escalate misinformation to propaganda-level distortion. Our taxonomy of misinformation severity -- spanning factual errors, lies, and propaganda -- connects observed drift to established theories in misinformation studies. These findings demonstrate the dual role of LLMs as both proxies for human-like biases and as auditors capable of tracing information fidelity. The proposed framework provides an interpretable, empirically grounded approach for studying, simulating, and mitigating misinformation diffusion in digital ecosystems.