Paper 2021/825
Balancing Quality and Efficiency in Private Clustering with Affinity Propagation
Hannah Keller, Helen Möllering, Thomas Schneider, and Hossein Yalame
Abstract
In many machine learning applications, training data consists of sensitive information from multiple sources. Privacy-preserving machine learning using secure computation enables multiple parties to compute on their joint data without disclosing their inputs to each other. In this work, we focus on clustering, an unsupervised machine learning technique that partitions data into groups. Previous works on privacy-preserving clustering often leak information and focus on the k-means algorithm, which provides only limited clustering quality and flexibility. Additionally, the number of clusters k must be known in advance. We analyze several prominent clustering algorithms' capabilities and their compatibility with secure computation techniques to create an efficient, fully privacy-preserving clustering implementation superior to k-means. We find affinity propagation to be the most promising candidate and securely implement it using various multi-party computation techniques. Privacy-preserving affinity propagation does not require any input parameters and consists of operations hat are relatively efficient with secure computation. As threat models, we consider passive security as well as active security with an honest and dishonest majority. We offer the first comparison of privacy-preserving clustering between these scenarios, enabling an understanding of the exact trade-offs between them. Based on the clustering quality and the computational and communication costs, privacy-preserving affinity propagation offers a good trade-off between quality and efficiency for practical privacy-preserving clustering.
Metadata
- Available format(s)
- Category
- Applications
- Publication info
- Published elsewhere. Minor revision. SECRYPT'21
- Keywords
- Privacy-preserving Machine LearningClusteringSecure Computation
- Contact author(s)
-
hannah keller @ stud tu-darmstadt de
moellering @ encrypto cs tu-darmstadt de
schneider @ encrypto cs tu-darmstadt de
yalame @ encrypto cs tu-darmstadt de - History
- 2021-07-06: last of 2 revisions
- 2021-06-16: received
- See all versions
- Short URL
- https://ia.cr/2021/825
- License
-
CC BY
BibTeX
@misc{cryptoeprint:2021/825, author = {Hannah Keller and Helen Möllering and Thomas Schneider and Hossein Yalame}, title = {Balancing Quality and Efficiency in Private Clustering with Affinity Propagation}, howpublished = {Cryptology {ePrint} Archive, Paper 2021/825}, year = {2021}, url = {https://eprint.iacr.org/2021/825} }