Privacy in large language models: Attacks, defenses and future directions

H Li, Y Chen, J Luo, J Wang, H Peng, Y Kang… - arxiv preprint arxiv …, 2023 - arxiv.org
The advancement of large language models (LLMs) has significantly enhanced the ability to
effectively tackle various downstream NLP tasks and unify these tasks into generative …

Sigma: Secure gpt inference with function secret sharing

K Gupta, N Jawalkar, A Mukherjee… - Cryptology ePrint …, 2023 - eprint.iacr.org
Abstract Secure 2-party computation (2PC) enables secure inference that offers protection
for both proprietary machine learning (ML) models and sensitive inputs to them. However …

Ciphergpt: Secure two-party gpt inference

X Hou, J Liu, J Li, Y Li, W Lu, C Hong… - Cryptology ePrint …, 2023 - eprint.iacr.org
ChatGPT is recognized as a significant revolution in the field of artificial intelligence, but it
raises serious concerns regarding user privacy, as the data submitted by users may contain …

Bumblebee: Secure two-party inference framework for large transformers

W Lu, Z Huang, Z Gu, J Li, J Liu, C Hong… - Cryptology ePrint …, 2023 - eprint.iacr.org
Large transformer-based models have realized state-of-the-art performance on lots of real-
world tasks such as natural language processing and computer vision. However, with the …

Secure transformer inference made non-interactive

J Zhang, X Yang, L He, K Chen, W Lu… - Cryptology ePrint …, 2024 - eprint.iacr.org
Secure transformer inference has emerged as a prominent research topic following the
proliferation of ChatGPT. Existing solutions are typically interactive, involving substantial …

Secformer: Towards fast and accurate privacy-preserving inference for large language models

J Luo, Y Zhang, Z Zhang, J Zhang, X Mu… - arxiv preprint arxiv …, 2024 - arxiv.org
With the growing use of large language models hosted on cloud platforms to offer inference
services, privacy concerns are escalating, especially concerning sensitive data like …

Panther: Practical Secure 2-Party Neural Network Inference

J Feng, Y Wu, H Sun, S Zhang… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
Secure two-party neural network (2P-NN) inference allows the server with a neural network
model and the client with inputs to perform neural network inference without revealing their …

Rhombus: Fast Homomorphic Matrix-Vector Multiplication for Secure Two-Party Inference

J He, K Yang, G Tang, Z Huang, L Lin, C Wei… - Proceedings of the …, 2024 - dl.acm.org
We present Rhombus, a new secure matrix-vector multiplication (MVM) protocol in the semi-
honest two-party setting, which is able to be seamlessly integrated into existing privacy …

Mpc-minimized secure llm inference

D Rathee, D Li, I Stoica, H Zhang, R Popa - arxiv preprint arxiv …, 2024 - arxiv.org
Many inference services based on large language models (LLMs) pose a privacy concern,
either revealing user prompts to the service or the proprietary weights to the user. Secure …

Privcirnet: Efficient private inference via block circulant transformation

T Xu, L Wu, R Wang, M Li - arxiv preprint arxiv:2405.14569, 2024 - arxiv.org
Homomorphic encryption (HE)-based deep neural network (DNN) inference protects data
and model privacy but suffers from significant computation overhead. We observe …