On the Security Risks of ML-based Malware Detection Systems: A Survey

He, Ping, Mao, Yuhao, Li, Changjiang, Cavallaro, Lorenzo, Wang, Ting, Ji, Shouling

May-19-2025–arXiv.org Artificial Intelligence

Malware presents a persistent threat to user privacy and data integrity. To combat this, machine learning-based (ML-based) malware detection (MD) systems have been developed. However, these systems have increasingly been attacked in recent years, undermining their effectiveness in practice. While the security risks associated with ML-based MD systems have garnered considerable attention, the majority of prior works is limited to adversarial malware examples, lacking a comprehensive analysis of practical security risks. This paper addresses this gap by utilizing the CIA principles to define the scope of security risks. We then deconstruct ML-based MD systems into distinct operational stages, thus developing a stage-based taxonomy. Utilizing this taxonomy, we summarize the technical progress and discuss the gaps in the attack and defense proposals related to the ML-based MD systems within each stage. Subsequently, we conduct two case studies, using both inter-stage and intra-stage analyses according to the stage-based taxonomy to provide new empirical insights. Based on these analyses and insights, we suggest potential future directions from both inter-stage and intra-stage perspectives.

artificial intelligence, machine learning, ml-based md system, (13 more...)

arXiv.org Artificial Intelligence

May-19-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > Suffolk County > Stony Brook (0.04)
- Europe
  - United Kingdom > England
    - Greater London > London (0.04)
  - Switzerland > Zürich
    - Zürich (0.14)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
- Asia
  - Nepal (0.04)
  - China > Zhejiang Province
    - Hangzhou (0.04)

Genre:
- Research Report (1.00)
- Overview (0.92)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Performance Analysis > Accuracy (0.94)
    - Statistical Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found