Influence-Guided Concolic Testing of Transformer Robustness
Hong, Chih-Duo, Wang, Yu, Chang, Yao-Chen, Yu, Fang
–arXiv.org Artificial Intelligence
Concolic testing for deep neural networks alternates concrete execution with constraint solving to search for inputs that flip decisions. We present an {influence-guided} concolic tester for Transformer classifiers that ranks path predicates by SHAP-based estimates of their impact on the model output. To enable SMT solving on modern architectures, we prototype a solver-compatible, pure-Python semantics for multi-head self-attention and introduce practical scheduling heuristics that temper constraint growth on deeper models. In a white-box study on compact Transformers under small $L_0$ budgets, influence guidance finds label-flip inputs more efficiently than a FIFO baseline and maintains steady progress on deeper networks. Aggregating successful attack instances with a SHAP-based critical decision path analysis reveals recurring, compact decision logic shared across attacks. These observations suggest that (i) influence signals provide a useful search bias for symbolic exploration, and (ii) solver-friendly attention semantics paired with lightweight scheduling make concolic testing feasible for contemporary Transformer models, offering potential utility for debugging and model auditing.
arXiv.org Artificial Intelligence
Sep-30-2025
- Country:
- Asia
- Taiwan (0.04)
- Vietnam > Hải Dương Province
- Hải Dương (0.04)
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- North America
- Canada (0.04)
- Mexico > Gulf of Mexico (0.04)
- United States
- Illinois > Cook County
- Chicago (0.04)
- New York (0.04)
- Illinois > Cook County
- Asia
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Information Technology (0.46)
- Technology: