RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Peng, Hongwu, Zhou, Shanglin, Luo, Yukui, Xu, Nuo, Duan, Shijin, Ran, Ran, Zhao, Jiahui, Huang, Shaoyi, Xie, Xi, Wang, Chenghong, Geng, Tong, Wen, Wujie, Xu, Xiaolin, Ding, Caiwen
–arXiv.org Artificial Intelligence
In this work, we propose a novel approach, Machine-Learning-as-a-Service (MLaaS) has emerged as the ReLU-Reduced Neural Architecture Search a popular solution for accelerating inference in various applications (RRNet) framework, that jointly optimizes the structure of [1]-[11]. The challenges of MLaaS comes from the deep neural network (DNN) model and the hardware several folds: inference latency and privacy. To accelerate the architecture to support high-performance MPC-based PI. Our MLaaS training and inference application, accelerated gradient framework eliminates the need for manual heuristic analysis sparsification [12], [13] and model compression methods [14]- by automating the process of exploring the design space [22] are proposed. On the other side, a major limitation of and identifying the optimal configuration of DNN models MLaaS is the requirement for clients to reveal raw input data and hardware architectures for 2PC-based PI. We use FPGA to the service provider, which may compromise the privacy accelerator design as a demonstration and summarize our of users. This issue has been highlighted in previous studies contributions: such as [23]. In this work, we aim to address this challenge 1) We propose a novel approach to addressing the high by proposing a novel approach for privacy-preserving MLaaS.
arXiv.org Artificial Intelligence
Feb-22-2023
- Country:
- Europe (0.68)
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (0.69)
- Information Technology > Security & Privacy (0.68)
- Technology: