Reviews: A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks

Oct-7-2024, 23:37:05 GMT–Neural Information Processing Systems

The problem of the study is detecting abnormalities within deep neural networks, to detect out-of-distribution inputs, adversarial inputs, and new classes (for class incremental learning). To achieve this, the authors integrate class-conditional Gaussian distributions with a tied covariance (linear discriminant analysis) at various stages of a target neural network and construct distributions over the valid input (in-liers). They use the Mahalanobis distance measure of the Gaussian distribution as a confidence measure (proportional to the log-likelihood). They further enhance the confidence measure by taking Fast Gradient-Sign Method-style steps in the input space to increase the score. Finally, they combine the scores gathered at different layers of the neural network through a linear combination.

detecting out-of-distribution sample, out-of-distribution sample and adversarial attack, simple unified framework, (9 more...)

Neural Information Processing Systems

Oct-7-2024, 23:37:05 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.40)
- Government > Military (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)