LLMs in Software Security: A Survey of Vulnerability Detection Techniques and Insights
Sheng, Ze, Chen, Zhicheng, Gu, Shuning, Huang, Heqing, Gu, Guofei, Huang, Jeff
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) are emerging as transformative tools for software vulnerability detection, addressing critical challenges in the security domain. Traditional methods, such as static and dynamic analysis, often falter due to inefficiencies, high false positive rates, and the growing complexity of modern software systems. By leveraging their ability to analyze code structures, identify patterns, and generate repair sugges- tions, LLMs, exemplified by models like GPT, BERT, and CodeBERT, present a novel and scalable approach to mitigating vulnerabilities. This paper provides a detailed survey of LLMs in vulnerability detection. It examines key aspects, including model architectures, application methods, target languages, fine-tuning strategies, datasets, and evaluation metrics. We also analyze the scope of current research problems, highlighting the strengths and weaknesses of existing approaches. Further, we address challenges such as cross-language vulnerability detection, multimodal data integration, and repository-level analysis. Based on these findings, we propose solutions for issues like dataset scalability, model interpretability, and applications in low-resource scenarios. Our contributions are threefold: (1) a systematic review of how LLMs are applied in vulnerability detection; (2) an analysis of shared patterns and differences across studies, with a unified framework for understanding the field; and (3) a summary of key challenges and future research directions. This work provides valuable insights for advancing LLM-based vulnerability detection. We also maintain and regularly update latest selected paper on https://github.com/OwenSanzas/LLM-For-Vulnerability-Detection
arXiv.org Artificial Intelligence
Feb-12-2025
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Texas > Brazos County
- College Station (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- New York > New York County
- New York City (0.05)
- California
- San Francisco County > San Francisco (0.14)
- San Diego County > San Diego (0.04)
- Washington > King County
- Europe
- Austria > Vienna (0.14)
- Switzerland > Basel-City
- Basel (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Greece > Attica
- Athens (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- China > Hong Kong (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Middle East
- Jordan (0.04)
- Iran > Tehran Province
- Tehran (0.04)
- India > Telangana
- Hyderabad (0.04)
- South America > Colombia
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.67)
- Industry:
- Technology: