Attackers Can Do Better: Over- and Understated Factors of Model Stealing Attacks

Oliynyk, Daryna, Mayer, Rudolf, Rauber, Andreas

Mar-8-2025–arXiv.org Artificial Intelligence

Machine learning models were shown to be vulnerable to model stealing attacks, which lead to intellectual property infringement. Among other methods, substitute model training is an all-encompassing attack applicable to any machine learning model whose behaviour can be approximated from input-output queries. Whereas prior works mainly focused on improving the performance of substitute models by, e.g. developing a new substitute training method, there have been only limited ablation studies on the impact the attacker's strength has on the substitute model's performance. As a result, different authors came to diverse, sometimes contradicting, conclusions. In this work, we exhaustively examine the ambivalent influence of different factors resulting from varying the attacker's capabilities and knowledge on a substitute training attack. Our findings suggest that some of the factors that have been considered important in the past are, in fact, not that influential; instead, we discover new correlations between attack conditions and success rate. In particular, we demonstrate that better-performing target models enable higher-fidelity attacks and explain the intuition behind this phenomenon. Further, we propose to shift the focus from the complexity of target models toward the complexity of their learning tasks. Therefore, for the substitute model, rather than aiming for a higher architecture complexity, we suggest focusing on getting data of higher complexity and an appropriate architecture. Finally, we demonstrate that even in the most limited data-free scenario, there is no need to overcompensate weak knowledge with millions of queries. Our results often exceed or match the performance of previous attacks that assume a stronger attacker, suggesting that these stronger attacks are likely endangering a model owner's intellectual property to a significantly higher degree than shown until now.

attacker, substitute model, target model, (15 more...)

arXiv.org Artificial Intelligence

Mar-8-2025

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - Rio de Janeiro > Rio de Janeiro (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - United States
    - Texas > Travis County
      - Austin (0.04)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - Massachusetts > Suffolk County
      - Boston (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
    - Florida > Orange County
      - Orlando (0.04)
    - Colorado > Denver County
      - Denver (0.04)
    - California
      - San Francisco County > San Francisco (0.04)
      - San Diego County > San Diego (0.04)
      - Los Angeles County > Long Beach (0.04)
  - Canada
    - Ontario > Toronto (0.14)
    - Quebec > Montreal (0.04)
    - New Brunswick > York County
      - Fredericton (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - Austria > Vienna (0.14)
  - France (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Italy > Piedmont
    - Turin Province > Turin (0.04)
  - Hungary > Budapest
    - Budapest (0.04)
- Asia
  - Singapore > Central Region
    - Singapore (0.04)
  - Middle East
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
    - Israel > Tel Aviv District
      - Tel Aviv (0.04)
  - China
    - Shaanxi Province > Xi'an (0.04)
    - Hong Kong (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Government (0.92)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found