Utilizing Network Properties to Detect Erroneous Inputs

Mar-24-2023–arXiv.org Artificial Intelligence

Neural networks are vulnerable to a wide range of erroneous inputs such as adversarial, corrupted, out-of-distribution, and misclassified examples. In this work, we train a linear SVM classifier to detect these four types of erroneous data using hidden and softmax feature vectors of pre-trained neural networks. Our results indicate that these faulty data types generally exhibit linearly separable activation properties from correct examples, giving us the ability to reject bad inputs with no extra training or overhead. We experimentally validate our findings across a diverse range of datasets, domains, pre-trained models, and adversarial attacks.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Mar-24-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (1.00)

Industry:
- Information Technology > Security & Privacy (0.35)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.69)
    - Performance Analysis > Accuracy (0.96)
    - Statistical Learning > Support Vector Machines (1.00)
  - Data Science (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found