Cryptology ePrint Archive: Report 2019/1222

Sub-Linear Privacy-Preserving Near-Neighbor Search

M. Sadegh Riazi and Beidi Chen and Anshumali Shrivastava and Dan Wallach and Farinaz Koushanfar

Abstract: In Near-Neighbor Search (NNS), a client queries a database (held by a server) for the most similar data (near-neighbors) given a certain similarity metric. The Privacy-Preserving variant (PP-NNS) requires that neither server nor the client shall learn information about the other party’s data except what can be inferred from the outcome of NNS. The overwhelming growth in the size of current datasets and the lack of a truly secure server in the online world render the existing solutions impractical; either due to their high computational requirements or non-realistic assumptions which potentially compromise privacy. PP-NNS having query time sub-linear in the size of the database has been suggested as an open research direction by Li et al. (CCSW’15). In this paper, we provide the first such algorithm, called Privacy-Preserving Locality Sensitive Indexing (SLSI) which has a sub-linear query time and the ability to handle honest-but-curious parties. At the heart of our proposal lies a secure binary embedding scheme generated from a novel probabilistic transformation over locality sensitive hashing family. We provide information-theoretic bound for the privacy guarantees and support our theoretical claims using substantial empirical evidence on real-world datasets.

Category / Keywords: foundations / Near-neighbor search, privacy, locality sensitive hashing, similarity search

Date: received 17 Oct 2019

Contact author: sadeghriazi at gmail com

Available format(s): PDF | BibTeX Citation

Version: 20191021:081950 (All versions of this report)

Short URL: ia.cr/2019/1222


[ Cryptology ePrint archive ]