Paper 2024/1141

Optimized Privacy-Preserving Clustering with Fully Homomorphic Encryption

Chen Yang, Chongqing Key Laboratory of Secure Computing for Biology, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing College, University of Chinese Academy of Sciences
Jingwei Chen, Chongqing Key Laboratory of Secure Computing for Biology, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing College, University of Chinese Academy of Sciences
Wenyuan Wu, Chongqing Key Laboratory of Secure Computing for Biology, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing College, University of Chinese Academy of Sciences
Yong Feng, Chongqing Key Laboratory of Secure Computing for Biology, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing College, University of Chinese Academy of Sciences
Abstract

Clustering is a crucial unsupervised learning method extensively used in the field of data analysis. For analyzing big data, outsourced computation is an effective solution but privacy concerns arise when involving sensitive information. Fully homomorphic encryption (FHE) enables computations on encrypted data, making it ideal for such scenarios. However, existing privacy-preserving clustering based on FHE are often constrained by the high computational overhead incurred from FHE, typically requiring decryption and interactions after only one iteration of the clustering algorithm. In this work, we propose a more efficient approach to evaluate the one-hot vector for the index of the minimum in an array with FHE, which fully exploits the parallelism of single-instruction-multiple-data of FHE schemes. By combining this with FHE bootstrapping, we present a practical FHE-based k-means clustering protocol whose required round of interactions between the data owner and the server is optimal, i.e., accomplishing the entire clustering process on encrypted data in a single round. We implement this protocol using the CKKS FHE scheme. Experiments show that our protocol significantly outperforms the state-of-the-art FHE-based k-means clustering protocols on various public datasets and achieves comparable accuracy to plaintext result. Additionally, We adapt our protocol to support mini-batch k-means for large-scale datasets and report its performance.

Metadata
Available format(s)
PDF
Category
Public-key cryptography
Publication info
Preprint.
Keywords
Clusteringk-meansMini-batch k-meansPrivacy-preservingFully homomorphic encryptionOutsourced computation
Contact author(s)
yangchen @ cigit ac cn
chenjingwei @ cigit ac cn
History
2024-07-15: approved
2024-07-13: received
See all versions
Short URL
https://ia.cr/2024/1141
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2024/1141,
      author = {Chen Yang and Jingwei Chen and Wenyuan Wu and Yong Feng},
      title = {Optimized Privacy-Preserving Clustering with Fully Homomorphic Encryption},
      howpublished = {Cryptology ePrint Archive, Paper 2024/1141},
      year = {2024},
      note = {\url{https://eprint.iacr.org/2024/1141}},
      url = {https://eprint.iacr.org/2024/1141}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.