Non-Robust Features are Not Always Useful in One-Class Classification

Lau, Matthew, Wang, Haoran, Helbling, Alec, Hul, Matthew, Peng, ShengYun, Andreoni, Martin, Lunardi, Willian T., Lee, Wenke

Jul-8-2024–arXiv.org Artificial Intelligence

The robustness of machine learning models has been questioned by the existence of adversarial examples. We examine the threat of adversarial examples in practical applications that require lightweight models for one-class classification. Building on Ilyas et al. (2019), we investigate the vulnerability of lightweight one-class classifiers to adversarial attacks and possible reasons for it. Our results show that lightweight one-class classifiers learn features that are not robust (e.g. texture) under stronger attacks. However, unlike in multi-class classification (Ilyas et al., 2019), these non-robust features are not always useful for the one-class task, suggesting that learning these unpredictive and non-robust features is an unwanted consequence of training.

classification, dataset, nrf, (15 more...)

arXiv.org Artificial Intelligence

Jul-8-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Switzerland (0.04)
- North America > United States
  - New York > New York County > New York City (0.04)

Genre:
- Research Report > New Finding (0.69)

Industry:
- Information Technology > Security & Privacy (0.49)
- Transportation (0.32)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (0.69)
  - Statistical Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found