AITopics | perturbing

Collaborating Authors

perturbing

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

d00904cebc0d5b69fada8ad33d0f1422-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-1-2026, 04:46:51 GMT

artificial intelligence, machine learning, pixel budget, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.49)
Information Technology > Artificial Intelligence > Machine Learning (0.34)

Add feedback

Strong and Precise Modulation of Human Percepts via Robustified ANNs Supplementary Material Pixel budget regimes

Neural Information Processing SystemsFeb-17-2026, 05:39:22 GMT

Subject screening To gain entry into the study, subjects were required to first perform a "demo" task consisting of 100 We refer to measures of human choice probability that are lapse-rate correct in this manner as "Normalized" (e.g., Supp. The typically observed lapse rates were quite low (median over subjects: 0%; mean 4.9%), indicating Figure 3: Human disruption rates are largely stable across stimulus presentation times. At shorter viewing times, we observed modest or no increases in disruption rate. Source images were captured with a smartphone camera. ImageNet classes, as previously defined in robustness library [2].

artificial intelligence, machine learning, pixel budget, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.49)
Information Technology > Artificial Intelligence > Machine Learning (0.33)

Add feedback

Perturbing Across the Feature Hierarchy to Improve Standard and Strict Blackbox Attack Transferability

Neural Information Processing SystemsDec-24-2025, 20:52:33 GMT

We consider the blackbox transfer-based targeted adversarial attack threat model in the realm of deep neural network (DNN) image classifiers. Rather than focusing on crossing decision boundaries at the output layer of the source model, our method perturbs representations throughout the extracted feature hierarchy to resemble other classes. We design a flexible attack framework that allows for multi-layer perturbations and demonstrates state-of-the-art targeted transfer performance between ImageNet DNNs. We also show the superiority of our feature space methods under a relaxation of the common assumption that the source and target models are trained on the same dataset and label space, in some instances achieving a $10\times$ increase in targeted success rate relative to other blackbox transfer methods. Finally, we analyze why the proposed methods outperform existing attack strategies and show an extension of the method in the case when limited queries to the blackbox model are allowed.

feature hierarchy, name change, strict blackbox attack transferability, (4 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (0.61)
Government > Military (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.98)

Add feedback

RAG-Pull: Imperceptible Attacks on RAG Systems for Code Generation

Stambolic, Vasilije, Dhar, Aritra, Cavigelli, Lukas

arXiv.org Artificial IntelligenceOct-14-2025

Retrieval-Augmented Generation (RAG) increases the reliability and trustworthiness of the LLM response and reduces hallucination by eliminating the need for model retraining. It does so by adding external data into the LLM's context. We develop a new class of black-box attack, RAG-Pull, that inserts hidden UTF characters into queries or external code repositories, redirecting retrieval toward malicious code, thereby breaking the models' safety alignment. We observe that query and code perturbations alone can shift retrieval toward attacker-controlled snippets, while combined query-and-target perturbations achieve near-perfect success. Once retrieved, these snippets introduce exploitable vulnerabilities such as remote code execution and SQL injection. RAG-Pull's minimal perturbations can alter the model's safety alignment and increase preference towards unsafe code, therefore opening up a new class of attacks on LLMs.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2510.11195

Country: Europe (0.28)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Review for NeurIPS paper: Perturbing Across the Feature Hierarchy to Improve Standard and Strict Blackbox Attack Transferability

Neural Information Processing SystemsFeb-7-2025, 23:18:01 GMT

Weaknesses: - The first major concern is the limited methodological contribution compared to FDA. The proposed method just aggregates (i.e., sum) FDA objectives of multiple layers and adding the cross-entropy term like other attack methods; in other words, these approaches are straightforward. Although the improvements of the proposed method are meaningful, it is not surprising or interesting results. TMIM/SGM methods do not use the training data for the white-box model while FDA-based frameworks use the data for training auxiliary functions g. In my opinion, access to only pre-trained white-box models largely differs from that to whole training data, and thus the latter uses more knowledge than the former.

feature hierarchy, strict blackbox attack transferability, training data, (2 more...)

Neural Information Processing Systems

Industry: Government (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Perturbing Across the Feature Hierarchy to Improve Standard and Strict Blackbox Attack Transferability

Neural Information Processing SystemsJan-14-2025, 15:57:56 GMT

We consider the blackbox transfer-based targeted adversarial attack threat model in the realm of deep neural network (DNN) image classifiers. Rather than focusing on crossing decision boundaries at the output layer of the source model, our method perturbs representations throughout the extracted feature hierarchy to resemble other classes. We design a flexible attack framework that allows for multi-layer perturbations and demonstrates state-of-the-art targeted transfer performance between ImageNet DNNs. We also show the superiority of our feature space methods under a relaxation of the common assumption that the source and target models are trained on the same dataset and label space, in some instances achieving a 10\times increase in targeted success rate relative to other blackbox transfer methods. Finally, we analyze why the proposed methods outperform existing attack strategies and show an extension of the method in the case when limited queries to the blackbox model are allowed.

feature hierarchy, perturbing, strict blackbox attack transferability, (1 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (0.65)
Government > Military (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback