Goto

Collaborating Authors

 ebpf


eACGM: Non-instrumented Performance Tracing and Anomaly Detection towards Machine Learning Systems

Xu, Ruilin, Xie, Zongxuan, Chen, Pengfei

arXiv.org Artificial Intelligence

--We present eACGM, a full-stack AI/ML system monitoring framework based on eBPF . Additionally, it leverages libnvml to gather process-level GPU resource usage information. By applying a Gaussian Mixture Model (GMM) to the collected multidimensional performance metrics for statistical modeling and clustering analysis, eACGM effectively identifies complex failure modes, such as latency anomalies, hardware failures, and communication inefficiencies, enabling rapid diagnosis of system bottlenecks and abnormal behaviors. T o evaluate eACGM's effectiveness and practicality, we conducted extensive empirical studies and case analyses in multi-node distributed training scenarios. The results demonstrate that eACGM, while maintaining a non-intrusive and low-overhead profile, successfully captures critical performance anomalies during model training and inference.


Ransomware Detection Using Machine Learning in the Linux Kernel

Brodzik, Adrian, Malec-Kruszyński, Tomasz, Niewolski, Wojciech, Tkaczyk, Mikołaj, Bocianiak, Krzysztof, Loui, Sok-Yen

arXiv.org Artificial Intelligence

Linux-based cloud environments have become lucrative targets for ransomware attacks, employing various encryption schemes at unprecedented speeds. Addressing the urgency for real-time ransomware protection, we propose leveraging the extended Berkeley Packet Filter (eBPF) to collect system call information regarding active processes and infer about the data directly at the kernel level. In this study, we implement two Machine Learning (ML) models in eBPF - a decision tree and a multilayer perceptron. Benchmarking latency and accuracy against their user space counterparts, our findings underscore the efficacy of this approach.


Leveraging eBPF and AI for Ransomware Nose Out

Sekar, Arjun, Kulkarni, Sameer G., Kuri, Joy

arXiv.org Artificial Intelligence

In this work, we propose a two-phased approach for real-time detection and deterrence of ransomware. To achieve this, we leverage the capabilities of eBPF (Extended Berkeley Packet Filter) and artificial intelligence to develop both proactive and reactive methods. In the first phase, we utilize signature based detection, where we employ custom eBPF programs to trace the execution of new processes and perform hash-based analysis against a known ransomware dataset. In the second, we employ a behavior-based technique that focuses on monitoring the process activities using a custom eBPF program and the creation of ransom notes, a prominent indicator of ransomware activity through the use of Natural Language Processing (NLP). By leveraging low-level tracing capabilities of eBPF and integrating NLP based machine learning algorithms, our solution achieves an impressive 99.76% accuracy in identifying ransomware incidents within a few seconds on the onset of zero-day attacks.