Exploring Semantic-constrained Adversarial Example with Instruction Uncertainty Reduction
–Neural Information Processing Systems
Recently, semantically constrained adversarial examples (SemanticAE), which are directly generated from natural language instructions, have become a promising avenue for future research due to their flexible attacking forms, but have not been thoroughly explored yet. To generate SemanticAEs, current methods fall short of satisfactory attacking ability as the key underlying factors of semantic uncertainty in human instructions, such as referring diversity, descriptive incompleteness, and boundary ambiguity, have not been fully investigated. To tackle the issues, this paper develops a multi-dimensional instruction uncertainty reduction (InsUR) framework to generate more satisfactory SemanticAE, i.e., transferable, adaptive, and effective. Specifically, in the dimension of the sampling method, we propose the residual-driven attacking direction stabilization to alleviate the unstable adversarial optimization caused by the diversity of language references. By coarsely predicting the language-guided sampling process, the optimization process will be stabilized by the designed ResAdv-DDIM sampler, therefore releasing the transferable and robust adversarial capability of multi-step diffusion models.
Neural Information Processing Systems
Jun-23-2026, 09:58:13 GMT
- Country:
- Asia (0.67)
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Government > Military (0.93)
- Technology:
- Information Technology
- Security & Privacy (1.00)
- Data Science (0.92)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning > Optimization (1.00)
- Natural Language (1.00)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.67)
- Information Technology