LLMs for Domain Generation Algorithm Detection
La O, Reynier Leyva, Catania, Carlos A., Parlanti, Tatiana
–arXiv.org Artificial Intelligence
We perform a detailed evaluation of two important techniques: In-Context Learning (ICL) and Supervised Fine-Tuning (SFT), showing how they can improve detection. SFT increases performance by using domain-specific data, whereas ICL helps the detection model to quickly adapt to new threats without requiring much retraining. We use Meta's Llama3 8B model, on a custom dataset with 68 malware families and normal domains, covering several hard-to-detect schemes, including recent word-based DGAs. Results proved that LLM-based methods can achieve competitive results in DGA detection. In particular, the SFT-based LLM DGA detector outperforms state-of-the-art models using attention layers, achieving 94% accuracy with a 4% false positive rate (FPR) and excelling at detecting word-based DGA domains.
arXiv.org Artificial Intelligence
Nov-5-2024
- Country:
- North America > United States
- New York (0.04)
- South America > Argentina
- Cuyo > Mendoza Province > Mendoza (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: