AITopics

doi: 10.35650/AIDP.4111.d.2025

2505.0002

Country:

North America > United States (0.46)
Europe (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Law (1.00)
Information Technology > Security & Privacy (0.94)
Government (0.93)
Media (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.59)

Anlen, Shirin, Wojciak, Zuzanna

TRIED: Truly Innovative and Effective AI Detection Benchmark, developed by WITNESS

arXiv.org Artificial IntelligenceMay-2-2025

The proliferation of generative AI and deceptive synthetic media threatens the global information ecosystem, especially across the Global Majority. This report from WITNESS highlights the limitations of current AI detection tools, which often underperform in real-world scenarios due to challenges related to explainability, fairness, accessibility, and contextual relevance. In response, WITNESS introduces the Truly Innovative and Effective AI Detection (TRIED) Benchmark, a new framework for evaluating detection tools based on their real-world impact and capacity for innovation. Drawing on frontline experiences, deceptive AI cases, and global consultations, the report outlines how detection tools must evolve to become truly innovative and relevant by meeting diverse linguistic, cultural, and technological contexts. It offers practical guidance for developers, policy actors, and standards bodies to design accountable, transparent, and user-centered detection solutions, and incorporate sociotechnical considerations into future AI standards, procedures and evaluation frameworks. By adopting the TRIED Benchmark, stakeholders can drive innovation, safeguard public trust, strengthen AI literacy, and contribute to a more resilient global information credibility.

artificial intelligence, machine learning, social media, (20 more...)

2504.21489

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)
Africa (1.00)

Genre:

Instructional Material (0.66)
Research Report (0.50)

Industry:

Media (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Applied AI (1.00)
Information Technology > Communications > Social Media (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

WIREDMay-1-2025, 13:56:35 GMT

Think Twice Before Creating That ChatGPT Action Figure

At the start of April, an influx of action figure started appearing on social media sites including LinkedIn and X. Each figure depicted the person who had created it with uncanny accuracy, complete with personalized accessories such as reusable coffee cups, yoga mats, and headphones. All this is possible because of OpenAI's new GPT-4o-powered image generator, which supercharges ChatGPT's ability to edit pictures, render text, and more. OpenAI's ChatGPT image generator can also create pictures in the style of Japanese animated film company Studio Ghibli--a trend that quickly went viral, too. The images are fun and easy to make--all you need is a free ChatGPT account and a photo.

large language model, machine learning, natural language, (11 more...)

WIRED

Country: Europe > United Kingdom (0.06)

Industry:

Media > Film (1.00)
Information Technology > Security & Privacy (1.00)
Leisure & Entertainment (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.60)

Grebe, Jonas Henry, Braun, Tobias, Rohrbach, Marcus, Rohrbach, Anna

Erased but Not Forgotten: How Backdoors Compromise Concept Erasure

The expansion of large-scale text-to-image diffusion models has raised growing concerns about their potential to generate undesirable or harmful content, ranging from fabricated depictions of public figures to sexually explicit images. To mitigate these risks, prior work has devised machine unlearning techniques that attempt to erase unwanted concepts through fine-tuning. However, in this paper, we introduce a new threat model, Toxic Erasure (ToxE), and demonstrate how recent unlearning algorithms, including those explicitly designed for robustness, can be circumvented through targeted backdoor attacks. The threat is realized by establishing a link between a trigger and the undesired content. Subsequent unlearning attempts fail to erase this link, allowing adversaries to produce harmful content. We instantiate ToxE via two established backdoor attacks: one targeting the text encoder and another manipulating the cross-attention layers. Further, we introduce Deep Intervention Score-based Attack (DISA), a novel, deeper backdoor attack that optimizes the entire U-Net using a score-based objective, improving the attack's persistence across different erasure methods. We evaluate five recent concept erasure methods against our threat model. For celebrity identity erasure, our deep attack circumvents erasure with up to 82% success, averaging 57% across all erasure methods. For explicit content erasure, ToxE attacks can elicit up to 9 times more exposed body parts, with DISA yielding an average increase by a factor of 2.9. These results highlight a critical security gap in current unlearning strategies.

artificial intelligence, machine learning, zhang, (18 more...)

2504.21072

Country:

Europe > Switzerland (0.28)
North America > United States (0.28)

Genre: Research Report (0.40)

Industry:

Information Technology (0.74)
Leisure & Entertainment (0.67)
Government > Regional Government (0.67)
Media > Music (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Wastl, Michelle, Vamvas, Jannis, Calleri, Selena, Sennrich, Rico

20min-XD: A Comparable Corpus of Swiss News Articles

We present 20min-XD (20 Minuten cross-lingual document-level), a French-German, document-level comparable corpus of news articles, sourced from the Swiss online news outlet 20 Minuten/20 minutes. Our dataset comprises around 15,000 article pairs spanning 2015 to 2024, automatically aligned based on semantic similarity. We detail the data collection process and alignment methodology. Furthermore, we provide a qualitative and quantitative analysis of the corpus. The resulting dataset exhibits a broad spectrum of cross-lingual similarity, ranging from near-translations to loosely related articles, making it valuable for various NLP applications and broad linguistically motivated studies. We publicly release the dataset in document- and sentence-aligned versions and code for the described experiments.

computational linguistic, machine learning, natural language, (19 more...)

2504.21677

Country:

Europe > Switzerland (0.68)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Media > News (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Robust Misinformation Detection by Visiting Potential Commonsense Conflict

Wang, Bing, Li, Ximing, Li, Changchun, Zhao, Bingrui, Fu, Bo, Guan, Renchu, Wang, Shengsheng

The development of Internet technology has led to an increased prevalence of misinformation, causing severe negative effects across diverse domains. To mitigate this challenge, Misinformation Detection (MD), aiming to detect online misinformation automatically, emerges as a rapidly growing research topic in the community. In this paper, we propose a novel plug-and-play augmentation method for the MD task, namely Misinformation Detection with Potential Commonsense Conflict (MD-PCC). We take inspiration from the prior studies indicating that fake articles are more likely to involve commonsense conflict. Accordingly, we construct commonsense expressions for articles, serving to express potential commonsense conflicts inferred by the difference between extracted commonsense triplet and golden ones inferred by the well-established commonsense reasoning tool COMET. These expressions are then specified for each article as augmentation. Any specific MD methods can be then trained on those commonsense-augmented articles. Besides, we also collect a novel commonsense-oriented dataset named CoMis, whose all fake articles are caused by commonsense conflict. We integrate MD-PCC with various existing MD backbones and compare them across both 4 public benchmark datasets and CoMis. Empirical results demonstrate that MD-PCC can consistently outperform the existing MD baselines.

d-pcc, knowledge management, machine learning, (19 more...)

2504.21604

Country: North America (0.46)

Genre: Research Report > New Finding (0.66)

Industry:

Media > News (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Knowledge Management (0.93)
(2 more...)

Detecting Manipulated Contents Using Knowledge-Grounded Inference

Meng, Mark Huasong, Wang, Ruizhe, Xu, Meng, Yan, Chuan, Bai, Guangdong

The detection of manipulated content, a prevalent form of fake news, has been widely studied in recent years. While existing solutions have been proven effective in fact-checking and analyzing fake news based on historical events, the reliance on either intrinsic knowledge obtained during training or manually curated context hinders them from tackling zero-day manipulated content, which can only be recognized with real-time contextual information. In this work, we propose Manicod, a tool designed for detecting zero-day manipulated content. Manicod first sources contextual information about the input claim from mainstream search engines, and subsequently vectorizes the context for the large language model (LLM) through retrieval-augmented generation (RAG). The LLM-based inference can produce a "truthful" or "manipulated" decision and offer a textual explanation for the decision. To validate the effectiveness of Manicod, we also propose a dataset comprising 4270 pieces of manipulated fake news derived from 2500 recent real-world news headlines. Manicod achieves an overall F1 score of 0.856 on this dataset and outperforms existing methods by up to 1.9x in F1 score on their benchmarks on fact-checking and claim verification.

large language model, machine learning, natural language, (20 more...)

2504.21165

Country:

North America > United States (1.00)
Asia (1.00)
Europe > United Kingdom (0.67)

Genre: Research Report > New Finding (0.46)

Industry:

Media > News (1.00)
Information Technology > Security & Privacy (1.00)
Government (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

ScienceApr-30-2025, 22:40:00 GMT

'Unethical' AI research on Reddit under fire

A study that used artificial intelligence–generated content to "participate" in online discussions and test whether AI was more successful at changing people's minds than human-generated content has caused an uproar because of ethical concerns about the work. This week some of the unwitting research participants publicly asked the University of Zürich (UZH), where the researchers behind the experiment hold positions, to investigate and apologize. "I think people have a reasonable expectation to not be in scientific experiments without their consent," says Casey Fiesler, an expert on internet research ethics at the University of Colorado Boulder. A university statement emailed to Science says the researchers--who remain anonymous--have decided not to publish their results. The university will investigate the incident, the statement says.

experiment, large language model, natural language, (19 more...)

Science

Country:

Europe > Switzerland > Zürich > Zürich (0.26)
North America > United States > Colorado > Boulder County > Boulder (0.25)
North America > United States > Massachusetts (0.05)
(3 more...)

Genre:

Research Report > Experimental Study (0.32)
Research Report > New Finding (0.30)

Industry:

Law (0.99)
Media > News (0.70)
Information Technology > Security & Privacy (0.49)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)

ScienceApr-30-2025, 21:00:00 GMT

Trump's team, often accused of spreading misinformation, slashes misinformation research

On 28 March, Briony Swire-Thompson began seeing reports online that the National Institutes of Health (NIH) might cancel grants for research on misinformation. At first, she didn't think she would be affected. Swire-Thompson, a psychologist at Northeastern University, studies misinformation--but not the political lies that get most of the attention. She's interested in false information about cancer, and why people fall for it. "There's a lot of people online trying to sell their snake oil," she says.

artificial intelligence, misinformation, misinformation research, (17 more...)

Science

Country:

Europe (0.15)
North America > United States > Wisconsin > Dane County > Madison (0.05)

Genre: Research Report (0.49)

Industry:

Media > News (1.00)
Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence (0.49)

The GuardianApr-30-2025, 18:01:49 GMT

Ministers to amend data bill amid artists' concerns over AI and copyright

Artists including Paul McCartney and Tom Stoppard have thrown their weight behind a campaign against the changes in a series of high-level interventions. The government's commitments will be made in amendments to the data bill, which has become a vehicle for campaigners against the changes and is due to return to the Commons on Wednesday next week. The move has already been dismissed by critics. Ed Newton-Rex, a the British composer and prominent campaigner against the government proposals, said there was a "ton of evidence" showing the mooted changes were "terrible for creators". He added: "We don't need an impact assessment to tell us this."

ai company, artificial intelligence, government, (15 more...)

The Guardian

Country:

Europe > United Kingdom (0.18)
Asia > Middle East > Saudi Arabia (0.05)

Industry:

Government > Regional Government (0.75)
Media > Music (0.51)

Technology: Information Technology > Artificial Intelligence (1.00)