AITopics | pgd attack

Collaborating Authors

pgd attack

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Adversarial training for free!

Ali Shafahi, Mahyar Najibi, Mohammad Amin Ghiasi, Zheng Xu, John Dickerson, Christoph Studer, Larry S. Davis, Gavin Taylor, Tom Goldstein

Neural Information Processing SystemsFeb-12-2026, 15:15:50 GMT

Neural Information Processing Systems http://nips.cc/

adversarial example, adversarial training, robustness, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.05)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Government > Military (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

This allows us to give a convergence guarantee for the inner-loop PGD attack.

adversarial training, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.95)

Technology:

Information Technology > Security & Privacy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Adversarial training for free!

Ali Shafahi, Mahyar Najibi, Mohammad Amin Ghiasi, Zheng Xu, John Dickerson, Christoph Studer, Larry S. Davis, Gavin Taylor, Tom Goldstein

Neural Information Processing SystemsOct-3-2025, 00:36:35 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, robustness, (17 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Industry:

Government > Military (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

246a3c5544feb054f3ea718f61adfa16-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 09:36:29 GMT

artificial intelligence, crown, fast-lin, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.30)

Add feedback

DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation

Umrajkar, Ved

arXiv.org Artificial IntelligenceSep-26-2025

Vision-Language Models (VLMs) are foundational to critical applications like autonomous driving, medical diagnosis, and content moderation. While Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA enable their efficient adaptation to specialized tasks, these models remain vulnerable to adversarial attacks that can compromise safety-critical decisions. CLIP, the backbone for numerous downstream VLMs, is a high-value target whose vulnerabilities can cascade across the multimodal AI ecosystem. We propose Dynamic Adversarial Curriculum DAC-LoRA, a novel framework that integrates adversarial training into PEFT. The core principle of our method i.e. an intelligent curriculum of progressively challenging attack, is general and can potentially be applied to any iterative attack method. Guided by the First-Order Stationary Condition (FOSC) and a TRADES-inspired loss, DAC-LoRA achieves substantial improvements in adversarial robustness without significantly compromising clean accuracy. Our work presents an effective, lightweight, and broadly applicable method to demonstrate that the DAC-LoRA framework can be easily integrated into a standard PEFT pipeline to significantly enhance robustness.

curriculum, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.20792

Genre:

Research Report (0.82)
Instructional Material > Course Syllabus & Notes (0.50)

Industry:

Information Technology (0.69)
Government > Military (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

[Re] Improving Interpretation Faithfulness for Vision Transformers

Kurek, Izabela, Trejter, Wojciech, Frkovic, Stipe, Erdelez, Andro

arXiv.org Artificial IntelligenceSep-19-2025

This work aims to reproduce the results of Faithful Vision Transformers (FViTs) proposed by Hu et al. (2024) alongside interpretability methods for Vision Transformers from Chefer et al. (2021) and Xu et al. (2022). We investigate claims made by Hu et al. (2024), namely that the usage of Diffusion Denoised Smoothing (DDS) improves interpretability robustness to (1) attacks in a segmentation task and (2) perturbation and attacks in a classification task. We also extend the original study by investigating the authors' claims that adding DDS to any interpretability method can improve its robustness under attack. This is tested on baseline methods and the recently proposed Attribution Rollout method.

artificial intelligence, machine learning research, robustness, (16 more...)

arXiv.org Artificial Intelligence

2509.14846

Country: Europe > Switzerland (0.28)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology (0.46)

Technology: