Adversarial Examples Exist in Two-Layer ReLU Networks for Low Dimensional Linear Subspaces

Apr-24-2026, 22:46:22 GMT–Neural Information Processing Systems

Despite a great deal of research, it is still not well-understood why trained neural networks are highly vulnerable to adversarial examples. In this work we focus on two-layer neural networks trained using data which lie on a low dimensional linear subspace. We show that standard gradient methods lead to non-robust neural networks, namely, networks which have large gradients in directions orthogonal to the data subspace, and are susceptible to small adversarial L2-perturbations in these directions. Moreover, we show that decreasing the initialization scale of the training algorithm, or adding L2 regularization, can make the trained network more robust to adversarial perturbations orthogonal to the data.

artificial intelligence, machine learning, perturbation, (20 more...)

Neural Information Processing Systems

Apr-24-2026, 22:46:22 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.28)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
Adversarial Examples Exist in Two-Layer ReLU Networks for Low Dimensional Linear Subspaces

Similar Docs Excel Report more

Title	Similarity	Source
None found