Paper 2024/1371
PIGEON: A Framework for Private Inference of Neural Networks
Abstract
Privacy-Preserving Machine Learning is one of the most relevant use cases for Secure Multiparty Computation (MPC). While private training of large neural networks such as VGG-16 or ResNet-50 on state-of-the-art datasets such as Imagenet is still out of reach, given the performance overhead of MPC, private inference is starting to achieve practical runtimes. However, we show that in contrast to plaintext machine learning, the usage of GPU acceleration for both linear and nonlinear neural network layers is actually counterproductive in PPML and leads to performance and scaling penalties. This can be observed by slow ReLU performance, high GPU memory requirements, and inefficient batch processing in state-of-the-art PPML frameworks, which hinders them from scaling to multiple images per second inference throughput and more than eight images per batch on ImageNet. To overcome these limitations, we propose PIGEON, an open-source framework for Private Inference of Neural Networks. PIGEON utilizes a novel ABG programming model that switches between \underline{A}rithmetic vectorization, \underline{B}itslicing, and \underline{G}PU offloading depending on the MPC-specific computation required by each layer. Compared to the state-of-the-art PPML framework Piranha, PIGEON achieves two orders of magnitude improvements in ReLU throughput, reduces peak GPU memory utilization by one order of magnitude, and scales better with large batch size. This translates to one to two orders of magnitude improvements in throughput for large ImageNet batch sizes (e.g. 192) and more than 70\% saturation of a 25 Gbit/s network.
Metadata
- Available format(s)
- Category
- Cryptographic protocols
- Publication info
- Preprint.
- Keywords
- Privacy-Preserving Machine LearningSecure InferenceSecure Multiparty ComputationImplementation
- Contact author(s)
-
christopher harth-kitzerow @ tum de
yongqin @ usc edu
rrajt @ usc edu
carle @ net in tum de
annavara @ usc edu - History
- 2024-09-10: revised
- 2024-09-01: received
- See all versions
- Short URL
- https://ia.cr/2024/1371
- License
-
CC BY
BibTeX
@misc{cryptoeprint:2024/1371, author = {Christopher Harth-Kitzerow and Yongqin Wang and Rachit Rajat and Georg Carle and Murali Annavaram}, title = {{PIGEON}: A Framework for Private Inference of Neural Networks}, howpublished = {Cryptology {ePrint} Archive, Paper 2024/1371}, year = {2024}, url = {https://eprint.iacr.org/2024/1371} }