SoK: Taxonomy and Evaluation of Prompt Security in Large Language Models

Hong, Hanbin, Feng, Shuya, Naderloui, Nima, Yan, Shenao, Zhang, Jingyu, Liu, Biying, Arastehfard, Ali, Huang, Heqing, Hong, Yuan

Oct-22-2025–arXiv.org Artificial Intelligence

Large Language Models (LLMs) have rapidly transitioned from academic research to core components of real-world applications, especially since the emergence of high-profile foundation models such as OpenAI's GPT series [17, 140], Google Gemini [9], Meta Llama [175, 176], Anthropic Claude [12], Alibaba Qwen [11, 210, 209], and Doubao [172]. Today, LLMs are deployed across an unprecedented range of sectors--from web search and code assistants to legal, educational, and healthcare domains--reaching hundreds of millions of end users globally. The rapid adoption of LLMs has ushered in a new era of AI-powered services, but it also brings serious safety and security risks. These risks manifest in multiple forms, ranging from misinformation and privacy leaks to adversarial attacks that exploit model vulnerabilities. In particular, a growing body of work shows that carefully crafted jailbreak prompts can bypass alignment constraints, inducing models to produce sensitive, illegal, or harmful content. Alarmingly, recent studies report that such attacks achieve success rates exceeding 90% even on flagship models such as GPT-4, Claude 3, and DeepSeek-R1 [124, 42, 154, 118]. The outputs generated through these attacks could be used for malicious purposes, underscoring the urgent need for close attention and mitigation.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Oct-22-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China
    - Hong Kong (0.04)
    - Shanghai > Shanghai (0.04)
  - Indonesia > Bali (0.04)
  - Malaysia > Sarawak
    - Kuching (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.14)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Italy > Lombardy
    - Milan (0.04)
  - United Kingdom > England
    - Devon > Exeter (0.04)
- North America
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
  - Dominican Republic (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - United States
    - California
      - Los Angeles County > Los Angeles (0.14)
      - San Francisco County > San Francisco (0.14)
      - Santa Clara County > San Jose (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Florida > Miami-Dade County
      - Miami (0.14)
    - New Mexico > Bernalillo County
      - Albuquerque (0.04)
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - Pennsylvania > Philadelphia County
      - Philadelphia (0.04)
    - Connecticut (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Virginia > Arlington County
      - Arlington (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - Alabama (0.04)
- Oceania > Australia
  - New South Wales > Sydney (0.04)
  - Victoria > Melbourne (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:
- Research Report > Experimental Study (0.47)

Industry:
- Education (0.92)
- Government > Military (0.88)
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.34)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found