Paper 2024/1675

Testing Robustness of Homomorphically Encrypted Split Model LLMs

Lars Wolfgang Folkerts, University of Delaware
Nektarios Georgios Tsoutsos, University of Delaware
Abstract

Large language models (LLMs) have recently transformed many industries, enhancing content generation, customer service agents, data analysis and even software generation. These applications are often hosted on remote servers to protect the neural-network model IP; however, this raises concerns about the privacy of input queries. Fully Homomorphic Encryption (FHE), an encryption technique that allows for computations on private data, has been proposed as a solution to the challenge. Nevertheless, due to the increased size of LLMs and the computational overheads of FHE, today's practical FHE LLMs are implemented using a split model approach. Here, a user sends their FHE encrypted data to the server to run an encrypted attention head layer; then the server returns the result of the layer for the user to run the rest of the model locally. By employing this method, the server maintains part of their model IP, and the user still gets to perform private LLM inference. In this work, we evaluate the neural-network model IP protections of single layer split model LLMs, and demonstrate a novel attack vector that makes it easy for a user to extract the neural network model IP from the server, bypassing the claimed protections for encrypted computation. In our analysis, we demonstrate the feasibility of this attack, and discuss potential mitigations.

Metadata
Available format(s)
PDF
Category
Attacks and cryptanalysis
Publication info
Preprint.
Keywords
Large Language ModelsModel Extraction AttackFully Homomorphic EncryptionSplit-Model Architecture
Contact author(s)
folkerts @ udel edu
tsoutsos @ udel edu
History
2024-10-18: approved
2024-10-15: received
See all versions
Short URL
https://ia.cr/2024/1675
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2024/1675,
      author = {Lars Wolfgang Folkerts and Nektarios Georgios Tsoutsos},
      title = {Testing Robustness of Homomorphically Encrypted Split Model {LLMs}},
      howpublished = {Cryptology {ePrint} Archive, Paper 2024/1675},
      year = {2024},
      url = {https://eprint.iacr.org/2024/1675}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.