Android Malware Detection with Unbiased Confidence Guarantees
Papadopoulos, Harris, Georgiou, Nestoras, Eliades, Charalambos, Konstantinidis, Andreas
–arXiv.org Artificial Intelligence
The impressive growth of smartphone devices in combination with the rising ubiquity of using mobile platforms for sensitive applications such as Internet banking, have triggered a rapid increase in mobile malware. In recent literature, many studies examine Machine Learning techniques, as the most promising approach for mobile malware detection, without however quantifying the uncertainty involved in their detections. In this paper, we address this problem by proposing a machine learning dynamic analysis approach that provides provably valid confidence guarantees in each malware detection. Moreover the particular guarantees hold for both the malicious and benign classes independently and are unaffected by any bias in the data. The proposed approach is based on a novel machine learning framework, called Conformal Prediction, combined with a random forests classifier. We examine its performance on a large-scale dataset collected by installing 1866 malicious and 4816 benign applications on a real android device. We make this collection of dynamic analysis data available to the research community. The obtained experimental results demonstrate the empirical validity, usefulness and unbiased nature of the outputs produced by the proposed approach.
arXiv.org Artificial Intelligence
Dec-17-2023
- Country:
- North America > United States
- New York (0.04)
- Europe
- Austria > Vienna (0.14)
- Middle East > Cyprus (0.04)
- Greece > Central Macedonia
- Thessaloniki (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.88)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology:
- Information Technology
- Security & Privacy (1.00)
- Communications > Mobile (1.00)
- Artificial Intelligence > Machine Learning
- Inductive Learning (0.68)
- Statistical Learning (0.68)
- Performance Analysis > Accuracy (0.46)
- Information Technology