On the Security Risks of ML-based Malware Detection Systems: A Survey
He, Ping, Mao, Yuhao, Li, Changjiang, Cavallaro, Lorenzo, Wang, Ting, Ji, Shouling
–arXiv.org Artificial Intelligence
Malware presents a persistent threat to user privacy and data integrity. To combat this, machine learning-based (ML-based) malware detection (MD) systems have been developed. However, these systems have increasingly been attacked in recent years, undermining their effectiveness in practice. While the security risks associated with ML-based MD systems have garnered considerable attention, the majority of prior works is limited to adversarial malware examples, lacking a comprehensive analysis of practical security risks. This paper addresses this gap by utilizing the CIA principles to define the scope of security risks. We then deconstruct ML-based MD systems into distinct operational stages, thus developing a stage-based taxonomy. Utilizing this taxonomy, we summarize the technical progress and discuss the gaps in the attack and defense proposals related to the ML-based MD systems within each stage. Subsequently, we conduct two case studies, using both inter-stage and intra-stage analyses according to the stage-based taxonomy to provide new empirical insights. Based on these analyses and insights, we suggest potential future directions from both inter-stage and intra-stage perspectives.
arXiv.org Artificial Intelligence
May-19-2025
- Country:
- Asia
- China > Zhejiang Province
- Hangzhou (0.04)
- Nepal (0.04)
- China > Zhejiang Province
- Europe
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- United Kingdom > England
- Greater London > London (0.04)
- Italy > Calabria
- North America > United States
- New York > Suffolk County > Stony Brook (0.04)
- Asia
- Genre:
- Overview (0.92)
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: