Cryptology ePrint Archive: Report 2013/732

SSS-V2: Secure Similarity Search

Hyun-A Park

Abstract: Encrypting information has been regarded as one of the most substantial approaches to protect users’ sensitive information in radically changing internet technology era. In prior research, researchers have considered similarity search over encrypted documents infeasible, because the single-bit difference of a plaintext would result in an enormous bits difference in the corresponding ciphertext. However, we propose a novel idea of Security Similarity Search (SSS) over encrypted documents by applying character-wise encryption with approximate string matching to keyword index search systems. In order to do this, we define the security requirements of similarity search over encrypted data, propose two similarity search schemes, and formally prove the security of the schemes. The first scheme is more efficient, while the second scheme achieves perfect similarity search privacy. Surprisingly, the second scheme turns out to be faster than other keyword index search schemes with keywordwise encryption, while enjoying the same level of security. The schemes of SSS support “like query(‘ab%’)” and a query with misprints in that the character-wise encryption preserves the degree of similarity between two plaintexts, and renders approximate string matching between the corresponding ciphertexts possible without decryption.

Category / Keywords: applications / encrypted data, similarity search,character-wise encryption, approximate string matching

Date: received 7 Nov 2013

Contact author: kokokzi at naver com

Available format(s): PDF | BibTeX Citation

Version: 20131114:060314 (All versions of this report)

Short URL:

[ Cryptology ePrint archive ]