Cryptology ePrint Archive: Report 2016/111

Scalable and Secure Logistic Regression via Homomorphic Encryption

Yoshinori Aono and Takuya Hayashi and Le Trieu Phong and Lihua Wang

Abstract: Logistic regression is a powerful machine learning tool to classify data. When dealing with sensitive data such as private or medical information, cares are necessary. In this paper, we propose a secure system for protecting both the training and predicting data in logistic regression via homomorphic encryption. Perhaps surprisingly, despite the non-polynomial tasks of training and predicting in logistic regression, we show that only additively homomorphic encryption is needed to build our system. Indeed, we instantiate our system with Paillier, LWE-based, and ring-LWE-based encryption schemes, highlighting the merits and demerits of each instance. Our system is very scalable in both the dataset size and dimension, tolerating big size for example of hundreds of millions ($10^8$s) records. Besides examining the costs of computation and communication, we carefully test our system over real datasets to demonstrate its accuracies and other related measures such as F-score and AUC.

Category / Keywords: applications / logistic regression, homomorphic encryption, Paillier, LWE, ring-LWE, outsourced computation, accuracy, F-score, area under curve

Original Publication (with major differences): IEICE Transactions 99-D(8): 2079-2089 (2016)

Date: received 9 Feb 2016, last revised 31 Mar 2017

Contact author: phong at nict go jp

Available format(s): PDF | BibTeX Citation

Version: 20170331:080318 (All versions of this report)

Short URL:

[ Cryptology ePrint archive ]