LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures
Aguilera-Martínez, Francisco, Berzal, Fernando
–arXiv.org Artificial Intelligence
As large language models (LLMs) continue to evolve, it is critical to assess the security threats and vulnerabilities that may arise both during their training phase and after models have been deployed. This survey seeks to define and categorize the various attacks targeting LLMs, distinguishing between those that occur during the training phase and those that affect already trained models. A thorough analysis of these attacks is presented, alongside an exploration of defense mechanisms designed to mitigate such threats. Defenses are classified into two primary categories: prevention-based and detection-based defenses. Furthermore, our survey summarizes possible attacks and their corresponding defense strategies. It also provides an evaluation of the effectiveness of the known defense mechanisms for the different security threats. Our survey aims to offer a structured framework for securing LLMs, while also identifying areas that require further research to improve and strengthen defenses against emerging security challenges.
arXiv.org Artificial Intelligence
May-5-2025
- Country:
- Africa > Zambia
- Southern Province > Choma (0.04)
- Asia
- China > Xinjiang Uygur Autonomous Region (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Middle East
- Israel (0.04)
- Jordan (0.04)
- Saudi Arabia > Asir Province
- Abha (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Austria > Vienna (0.14)
- Monaco (0.04)
- Portugal (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Spain > Andalusia
- Granada Province > Granada (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America > United States
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California
- San Diego County > San Diego (0.04)
- Santa Clara County
- Mountain View (0.04)
- Palo Alto (0.04)
- Stanford (0.04)
- Colorado > Denver County
- Denver (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New Mexico > Socorro County
- Socorro (0.04)
- New York > New York County
- New York City (0.04)
- Texas > Travis County
- Austin (0.04)
- Alaska > Anchorage Municipality
- Africa > Zambia
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: