Robustness Reprogramming for Representation Learning

Hou, Zhichao, Torkamani, MohamadAli, Krim, Hamid, Liu, Xiaorui

Oct-6-2024–arXiv.org Machine Learning

This work tackles an intriguing and fundamental open challenge in representation learning: Given a well-trained deep learning model, can it be reprogrammed to enhance its robustness against adversarial or noisy input perturbations without altering its parameters? To explore this, we revisit the core feature transformation mechanism in representation learning and propose a novel non-linear robust pattern matching technique as a robust alternative. Furthermore, we introduce three model reprogramming paradigms to offer flexible control of robustness under different efficiency requirements. Comprehensive experiments and ablation studies across diverse learning models ranging from basic linear model and MLPs to shallow and modern deep ConvNets demonstrate the effectiveness of our approaches. This work not only opens a promising and orthogonal direction for improving adversarial defenses in deep learning beyond existing methods but also provides new insights into designing more resilient AI systems with robust statistics. Deep neural networks (DNNs) have made significant impacts across various domains due to their powerful capability of learning representation from high-dimensional data (LeCun et al., 2015; Goodfellow et al., 2016). However, it has been well-documented that DNNs are highly vulnerable to adversarial attacks (Szegedy, 2013; Biggio et al., 2013).

architecture, learning, robustness, (13 more...)

arXiv.org Machine Learning

Oct-6-2024

arXiv.org PDF

Add feedback

Country:
- Asia (0.04)
- North America > United States
  - North Carolina (0.04)
- Europe > Czechia
  - Prague (0.04)

Genre:
- Research Report (0.82)

Industry:
- Information Technology > Security & Privacy (0.66)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found