Iron: Private inference on transformers

M Hao, H Li, H Chen, P **ng, G Xu… - Advances in neural …, 2022 - proceedings.neurips.cc
We initiate the study of private inference on Transformer-based models in the client-server
setting, where clients have private inputs and servers hold proprietary models. Our main …

Lingcn: Structural linearized graph convolutional network for homomorphically encrypted inference

H Peng, R Ran, Y Luo, J Zhao… - Advances in …, 2023 - proceedings.neurips.cc
Abstract The growth of Graph Convolution Network (GCN) model sizes has revolutionized
numerous applications, surpassing human performance in areas such as personal …

Autorep: Automatic relu replacement for fast private network inference

H Peng, S Huang, T Zhou, Y Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com
The growth of the Machine-Learning-As-A-Service (MLaaS) market has highlighted clients'
data privacy and security issues. Private inference (PI) techniques using cryptographic …

Sok: Cryptographic neural-network computation

LKL Ng, SSM Chow - 2023 IEEE Symposium on Security and …, 2023 - ieeexplore.ieee.org
We studied 53 privacy-preserving neural-network papers in 2016-2022 based on
cryptography (without trusted processors or differential privacy), 16 of which only use …

Adapi: Facilitating dnn model adaptivity for efficient private inference in edge computing

T Zhou, J Zhao, Y Luo, X **e, W Wen, C Ding… - arxiv preprint arxiv …, 2024 - arxiv.org
Private inference (PI) has emerged as a promising solution to execute computations on
encrypted data, safeguarding user privacy and model parameters in edge computing …

Secure transformer inference made non-interactive

J Zhang, X Yang, L He, K Chen, W Lu… - Cryptology ePrint …, 2024 - eprint.iacr.org
Secure transformer inference has emerged as a prominent research topic following the
proliferation of ChatGPT. Existing solutions are typically interactive, involving substantial …

Selective network linearization for efficient private inference

M Cho, A Joshi, B Reagen, S Garg… - … on Machine Learning, 2022 - proceedings.mlr.press
Private inference (PI) enables inferences directly on cryptographically secure data. While
promising to address many privacy issues, it has seen limited use due to extreme runtimes …

Scaling up trustless DNN inference with zero-knowledge proofs

D Kang, T Hashimoto, I Stoica, Y Sun - arxiv preprint arxiv:2210.08674, 2022 - arxiv.org
As ML models have increased in capabilities and accuracy, so has the complexity of their
deployments. Increasingly, ML model consumers are turning to service providers to serve …

Learning to linearize deep neural networks for secure and efficient private inference

S Kundu, S Lu, Y Zhang, J Liu, PA Beerel - arxiv preprint arxiv …, 2023 - arxiv.org
The large number of ReLU non-linearity operations in existing deep neural networks makes
them ill-suited for latency-efficient private inference (PI). Existing techniques to reduce ReLU …

Making models shallow again: Jointly learning to reduce non-linearity and depth for latency-efficient private inference

S Kundu, Y Zhang, D Chen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Large number of ReLU and MAC operations of Deep neural networks make them ill-suited
for latency and compute-efficient private inference. In this paper, we present a model …