Cryptology ePrint Archive: Report 2012/452
EPiC: Efficient Privacy-Preserving Counting for MapReduce
Erik-Oliver Blass and Guevara Noubir and Triet D. Vo-Huu
Abstract: In the face of an untrusted cloud infrastructure, outsourced data
needs to be protected. We present EPiC, a practical protocol for
the privacy-preserving evaluation of a fundamental operation on data
sets: frequency counting. In an encrypted outsourced data
set, a cloud user can specify a pattern, and the cloud will count
the number of occurrences of this pattern in an oblivious
manner. A pattern is expressed as a Boolean formula on the fields
of data records and can specify values counting, range counting, and
conjunctions/disjunctions of field values. We show how a
general pattern, defined by a Boolean formula, is arithmetized
into a multivariate polynomial and used in EPiC.
To increase the performance of the system, we introduce a new somewhat
homomorphic encryption scheme based on a previous work on the Hidden
Modular Group assumption. This scheme is highly
efficient in our particular counting scenario. Besides a formal analysis where
we prove EPiC's privacy, we also present implementation and
evaluation results. We specifically target Google's prominent
MapReduce paradigm as offered by major cloud providers. Our
evaluation performed both locally and in Amazon's public cloud with
data sets sizes of up to 1 TByte shows only modest overhead compared
to non-private counting, attesting to EPiC's efficiency.
Category / Keywords: Cloud Computing, Privacy, Somewhat Homomorphic Encryption, MapReduce
Date: received 8 Aug 2012, last revised 10 Apr 2013
Contact author: vohuudtr at ccs neu edu
Available formats: PDF | BibTeX Citation
Version: 20130411:004652 (All versions of this report)
Discussion forum: Show discussion | Start new discussion
[ Cryptology ePrint archive ]