A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
Wang, Kun, Zhang, Guibin, Zhou, Zhenhong, Wu, Jiahao, Yu, Miao, Zhao, Shiqian, Yin, Chenlong, Fu, Jinhu, Yan, Yibo, Luo, Hanjun, Lin, Liang, Xu, Zhihao, Lu, Haolang, Cao, Xinye, Zhou, Xinyun, Jin, Weifei, Meng, Fanci, Xu, Shicheng, Mao, Junyuan, Wang, Yu, Wu, Hao, Wang, Minghe, Zhang, Fan, Fang, Junfeng, Qu, Wenjie, Liu, Yue, Liu, Chengwei, Zhang, Yifan, Li, Qiankun, Guo, Chongye, Qin, Yalan, Fan, Zhaoxin, Wang, Kai, Ding, Yi, Hong, Donghai, Ji, Jiaming, Lai, Yingxin, Yu, Zitong, Li, Xinfeng, Jiang, Yifan, Li, Yanhui, Deng, Xinyu, Wu, Junlin, Wang, Dongxia, Huang, Yihao, Guo, Yufei, Huang, Jen-tse, Wang, Qiufeng, Jin, Xiaolong, Wang, Wenxuan, Liu, Dongrui, Yue, Yanwei, Huang, Wenke, Wan, Guancheng, Chang, Heng, Li, Tianlin, Yu, Yi, Li, Chenghao, Li, Jiawei, Bai, Lei, Zhang, Jie, Guo, Qing, Wang, Jingyi, Chen, Tianlong, Zhou, Joey Tianyi, Jia, Xiaojun, Sun, Weisong, Wu, Cong, Chen, Jing, Hu, Xuming, Li, Yiming, Wang, Xiao, Zhang, Ningyu, Tuan, Luu Anh, Xu, Guowen, Zhang, Jiaheng, Zhang, Tianwei, Ma, Xingjun, Gu, Jindong, Pang, Liang, Wang, Xiang, An, Bo, Sun, Jun, Bansal, Mohit, Pan, Shirui, Lyu, Lingjuan, Elovici, Yuval, Kailkhura, Bhavya, Yang, Yaodong, Li, Hongwei, Xu, Wenyuan, Sun, Yizhou, Wang, Wei, Li, Qing, Tang, Ke, Jiang, Yu-Gang, Juefei-Xu, Felix, Xiong, Hui, Wang, Xiaofeng, Tao, Dacheng, Yu, Philip S., Wen, Qingsong, Liu, Yang
–arXiv.org Artificial Intelligence
The remarkable success of Large Language Models (LLMs) has illuminated a promising pathway toward achieving Artificial General Intelligence for both academic and industrial communities, owing to their unprecedented performance across various applications. As LLMs continue to gain prominence in both research and commercial domains, their security and safety implications have become a growing concern, not only for researchers and corporations but also for every nation. Currently, existing surveys on LLM safety primarily focus on specific stages of the LLM lifecycle, e.g., deployment phase or fine-tuning phase, lacking a comprehensive understanding of the entire "lifechain" of LLMs. To address this gap, this paper introduces, for the first time, the concept of "full-stack" safety to systematically consider safety issues throughout the entire process of LLM training, deployment, and eventual commercialization. Compared to the off-the-shelf LLM safety surveys, our work demonstrates several distinctive advantages: (I) Comprehensive Perspective. We define the complete LLM lifecycle as encompassing data preparation, pre-training, post-training, deployment and final commercialization. To our knowledge, this represents the first safety survey to encompass the entire lifecycle of LLMs. (II) Extensive Literature Support. Our research is grounded in an exhaustive review of over 800+ papers, ensuring comprehensive coverage and systematic organization of security issues within a more holistic understanding. (III) Unique Insights. Through systematic literature analysis, we have developed reliable roadmaps and perspectives for each chapter. Our work identifies promising research directions, including safety in data generation, alignment techniques, model editing, and LLM-based agent systems. These insights provide valuable guidance for researchers pursuing future work in this field.
arXiv.org Artificial Intelligence
Jun-10-2025
- Country:
- Asia
- China
- Guangdong Province > Guangzhou (0.04)
- Hong Kong (0.04)
- Hubei Province > Wuhan (0.04)
- Shanghai > Shanghai (0.04)
- Middle East > Jordan (0.04)
- Singapore (0.04)
- China
- Europe
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Latvia > Lubāna Municipality
- Lubāna (0.04)
- Norway (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Italy > Calabria
- North America
- Canada > Ontario
- National Capital Region > Ottawa (0.04)
- United States
- California
- Los Angeles County > Los Angeles (0.13)
- San Diego County > San Diego (0.04)
- Illinois > Cook County
- Chicago (0.04)
- North Carolina (0.04)
- Pennsylvania (0.04)
- California
- Canada > Ontario
- Asia
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (0.92)
- New Finding (1.00)
- Industry:
- Media (1.00)
- Government > Military (1.00)
- Banking & Finance (0.92)
- Law (1.00)
- Energy (0.92)
- Education > Educational Setting
- Online (0.47)
- Leisure & Entertainment (1.00)
- Information Technology > Security & Privacy (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.92)
- Health & Medicine > Health Care Technology
- Medical Record (0.45)
- Technology: