AITopics | trojan

Collaborating Authors

trojan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Scanning Trojaned Models Using Out-of-Distribution Samples

Neural Information Processing SystemsMar-22-2026, 20:11:43 GMT

Scanning for trojan (backdoor) in deep neural networks is crucial due to their significant real-world applications. There has been an increasing focus on developing effective general trojan scanning methods across various trojan attacks. Despite advancements, there remains a shortage of methods that perform effectively without preconceived assumptions about the backdoor attack method. Additionally, we have observed that current methods struggle to identify classifiers trojaned using adversarial training. Motivated by these challenges, our study introduces a novel scanning method named TRODO (TROjan scanning by Detection of adversarial shifts in Out-of-distribution samples).

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)

Add feedback

Red Teaming Deep Neural Networks with Feature Synthesis Tools

Neural Information Processing SystemsFeb-18-2026, 04:02:27 GMT

We argue that this is due, in part, to a common feature of many interpretability methods: they analyze model behavior by using a particular dataset.

artificial intelligence, machine learning, visualization, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (0.68)

Industry:

Information Technology > Security & Privacy (0.46)
Transportation > Ground > Road (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

TrainingwithMoreConfidence: MitigatingInjected andNaturalBackdoorsDuringTraining

Neural Information Processing SystemsFeb-12-2026, 16:22:44 GMT

Researchers find that DNNs trained on benign data and settings can also learn backdoor behaviors, which is known as the natural backdoor.

artificial intelligence, machine learning, trojan, (18 more...)

Neural Information Processing Systems

Country: Asia > Nepal (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

3f9bf45ea04c98ad7cb857f951f499e2-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 12:45:43 GMT

dataset, target label, trojan, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

RethinkingtheReverse-engineeringofTrojanTriggers

Neural Information Processing SystemsFeb-8-2026, 12:45:39 GMT

Deep Neural Networks are vulnerable toTrojan (or backdoor) attacks.

artificial intelligence, machine learning, trojan, (20 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

3f9bbf77fbd858e5b6e39d39fe84ed2e-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 12:45:25 GMT

Denote the complexity of one forward and backward pass in feature extractor asae while that in classifier as ac.

artificial intelligence, exp, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback

Red Teaming Deep Neural Networks with Feature Synthesis Tools

Neural Information Processing SystemsDec-27-2025, 07:29:44 GMT

Interpretable AI tools are often motivated by the goal of understanding model behavior in out-of-distribution (OOD) contexts. Despite the attention this area of study receives, there are comparatively few cases where these tools have identified previously unknown bugs in models. We argue that this is due, in part, to a common feature of many interpretability methods: they analyze model behavior by using a particular dataset. This only allows for the study of the model in the context of features that the user can sample in advance. To address this, a growing body of research involves interpreting models using feature synthesis methods that do not depend on a dataset.

feature synthesis tool, name change, red teaming deep neural network, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback

Automated Hardware Trojan Insertion in Industrial-Scale Designs

Popryho, Yaroslav, Pal, Debjit, Partin-Vaisband, Inna

arXiv.org Artificial IntelligenceNov-13-2025

Abstract--Industrial Systems-on-Chips (SoCs) often comprise hundreds of thousands to millions of nets and millions to tens of millions of connectivity edges, making empirical evaluation of hardware-Trojan (HT) detectors on realistic designs both necessary and difficult. Public benchmarks remain significantly smaller and hand-crafted, while releasing truly malicious RTL raises ethical and operational risks. This work presents an automated and scalable methodology for generating HT -like patterns in industry-scale netlists whose purpose is to stress-test detection tools without altering user-visible functionality. The pipeline (i) parses large gate-level designs into connectivity graphs, (ii) explores rare regions using SCOAP testability metrics, and (iii) applies parameterized, function-preserving graph transformations to synthesize trigger-payload pairs that mimic the statistical footprint of stealthy HTs. When evaluated on the benchmarks generated in this work, representative state-of-the-art graph-learning models fail to detect Trojans. The framework closes the evaluation gap between academic circuits and modern SoCs by providing reproducible challenge instances that advance security research without sharing step-by-step attack instructions.

artificial intelligence, detector, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2511.08703

Country:

North America > United States (0.46)
Asia > Japan (0.28)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

Hammering the Diagnosis: Rowhammer-Induced Stealthy Trojan Attacks on ViT-Based Medical Imaging

Latibari, Banafsheh Saber, Nazari, Najmeh, Sayadi, Hossein, Homayoun, Houman, Mahalanobis, Abhijit

arXiv.org Artificial IntelligenceOct-30-2025

Abstract--Vision Transformers (ViTs) have emerged as powerful architectures in medical image analysis, excelling in tasks such as disease detection, segmentation, and classification. However, their reliance on large, attention-driven models makes them vulnerable to hardware-level attacks. In this paper, we propose a novel threat model referred to as Med-Hammer that combines the Rowhammer hardware fault injection with neural Trojan attacks to compromise the integrity of ViT -based medical imaging systems. Specifically, we demonstrate how malicious bit flips induced via Rowhammer can trigger implanted neural Trojans, leading to targeted misclassification or suppression of critical diagnoses (e.g., tumors or lesions) in medical scans. Through extensive experiments on benchmark medical imaging datasets such as ISIC, Brain T umor, and MedMNIST, we show that such attacks can remain stealthy while achieving high attack success rates about 82.51% and 92.56% in MobileViT and SwinTrans-former, respectively. We further investigate how architectural properties, such as model sparsity, attention weight distribution, and number of features of the layer, impact attack effectiveness. Our findings highlight a critical and underexplored intersection between hardware-level faults and deep learning security in healthcare applications, underscoring the urgent need for robust defenses spanning both model architectures and underlying hardware platforms. In clinical practice, medical imaging plays a central role in detecting, diagnosing, and monitoring a wide range of conditions.

accuracy, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2510.24976

Country: