Private Transformer Inference in MLaaS: A Survey
Li, Yang, Zhou, Xinyu, Wang, Yitong, Qian, Liangxin, Zhao, Jun
–arXiv.org Artificial Intelligence
Transformer models have revolutionized AI, powering applications like content generation and sentiment analysis. However, their deployment in Machine Learning as a Service (MLaaS) raises significant privacy concerns, primarily due to the centralized processing of sensitive user data. Private Transformer Inference (PTI) offers a solution by utilizing cryptographic techniques such as secure multi-party computation and homomorphic encryption, enabling inference while preserving both user data and model privacy. This paper reviews recent PTI advancements, highlighting state-of-the-art solutions and challenges. We also introduce a structured taxonomy and evaluation framework for PTI, focusing on balancing resource efficiency with privacy and bridging the gap between high-performance inference and data privacy.
arXiv.org Artificial Intelligence
May-16-2025
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: