Paper 2024/883
Low-Latency Linear Transformations with Small Key Transmission for Private Neural Network on Homomorphic Encryption
Abstract
In the field of Artificial Intelligence (AI), convolution operations have primarily been used in Convolutional Neural Networks (CNNs). However, its utility is increasing with the appearance of convolution integrated transformers or state space models where convolution is a constituent element. In the field of private AI, generalized algorithm, multiplexed parallel convolution was recently proposed to implement CNNs based on the Homomorphic Encryption scheme, residue number system variant Cheon-Kim-Kim-Song. Multiplexed parallel convolution is highly applicable, but its usage has been partly limited due to requiring many rotation operations. In this paper, we propose rotation optimized convolution, which reduces the rotation required for multiplexed parallel convolution, thus lowering latency, enhancing usability, and additionally decreasing the required rotation key. We additionally reduce the size of rotation keys by applying the hierarchical rotation key system, and our proposed small level key system. We also propose a new form of matrix-vector multiplication called Parallel Baby-Step Giant-Step matrix-vector multiplication which also reduces the number of rotations. In our experiment case, rotation optimized convolution achieved a maximum 70% reduction in execution time and 29× reduction for rotation keys using our method. Also, our proposed matrix-vector multiplication method achieved a reduction of execution time by up to 64%.
Metadata
- Available format(s)
- Category
- Applications
- Publication info
- Preprint.
- Keywords
- ConvolutionFully homomorphic encryptionHierarchical rotation key systemPrivate artificial intelligence
- Contact author(s)
-
mbyeongseo @ gmail com
jwlee2815 @ cau ac kr - History
- 2024-06-05: approved
- 2024-06-03: received
- See all versions
- Short URL
- https://ia.cr/2024/883
- License
-
CC BY
BibTeX
@misc{cryptoeprint:2024/883, author = {Byeong-Seo Min and Joon-Woo Lee}, title = {Low-Latency Linear Transformations with Small Key Transmission for Private Neural Network on Homomorphic Encryption}, howpublished = {Cryptology {ePrint} Archive, Paper 2024/883}, year = {2024}, url = {https://eprint.iacr.org/2024/883} }