Search CORE

7,183 research outputs found

Exploring Privacy Preservation in Outsourced K-Nearest Neighbors with Multiple Data Owners

Author: Li Frank
Paxson Vern
Shin Richard
Publication venue
Publication date: 29/07/2015
Field of study

The k-nearest neighbors (k-NN) algorithm is a popular and effective classification algorithm. Due to its large storage and computational requirements, it is suitable for cloud outsourcing. However, k-NN is often run on sensitive data such as medical records, user images, or personal information. It is important to protect the privacy of data in an outsourced k-NN system. Prior works have all assumed the data owners (who submit data to the outsourced k-NN system) are a single trusted party. However, we observe that in many practical scenarios, there may be multiple mutually distrusting data owners. In this work, we present the first framing and exploration of privacy preservation in an outsourced k-NN system with multiple data owners. We consider the various threat models introduced by this modification. We discover that under a particularly practical threat model that covers numerous scenarios, there exists a set of adaptive attacks that breach the data privacy of any exact k-NN system. The vulnerability is a result of the mathematical properties of k-NN and its output. Thus, we propose a privacy-preserving alternative system supporting kernel density estimation using a Gaussian kernel, a classification algorithm from the same family as k-NN. In many applications, this similar algorithm serves as a good substitute for k-NN. We additionally investigate solutions for other threat models, often through extensions on prior single data owner systems

arXiv.org e-Print Archive

CiteSeerX

SANNS: Scaling Up Secure Approximate k-Nearest Neighbors Search

Author: Chen Hao
Chillotti Ilaria
Dong Yihe
Poburinnaya Oxana
Razenshteyn Ilya
Riazi M. Sadegh
Publication venue
Publication date: 08/03/2020
Field of study

The

k

-Nearest Neighbor Search (

k

-NNS) is the backbone of several cloud-based services such as recommender systems, face recognition, and database search on text and images. In these services, the client sends the query to the cloud server and receives the response in which case the query and response are revealed to the service provider. Such data disclosures are unacceptable in several scenarios due to the sensitivity of data and/or privacy laws. In this paper, we introduce SANNS, a system for secure

k

-NNS that keeps client's query and the search result confidential. SANNS comprises two protocols: an optimized linear scan and a protocol based on a novel sublinear time clustering-based algorithm. We prove the security of both protocols in the standard semi-honest model. The protocols are built upon several state-of-the-art cryptographic primitives such as lattice-based additively homomorphic encryption, distributed oblivious RAM, and garbled circuits. We provide several contributions to each of these primitives which are applicable to other secure computation tasks. Both of our protocols rely on a new circuit for the approximate top-

k

selection from

n

numbers that is built from

O(n + k^2)

comparators. We have implemented our proposed system and performed extensive experimental results on four datasets in two different computation environments, demonstrating more than

18-31\times

faster response time compared to optimally implemented protocols from the prior work. Moreover, SANNS is the first work that scales to the database of 10 million entries, pushing the limit by more than two orders of magnitude.Comment: 18 pages, to appear at USENIX Security Symposium 202

arXiv.org e-Print Archive

Cryptology ePrint Archive

Lower Bounds for Oblivious Near-Neighbor Search

Author: Larsen Kasper Green
Malkin Tal
Weinstein Omri
Yeo Kevin
Publication venue
Publication date: 09/04/2019
Field of study

We prove an

\Omega(d \lg n/ (\lg\lg n)^2)

lower bound on the dynamic cell-probe complexity of statistically

\mathit{oblivious}

approximate-near-neighbor search (

\mathsf{ANN}

) over the

d

-dimensional Hamming cube. For the natural setting of

d = \Theta(\log n)

, our result implies an

\tilde{\Omega}(\lg^2 n)

lower bound, which is a quadratic improvement over the highest (non-oblivious) cell-probe lower bound for

\mathsf{ANN}

. This is the first super-logarithmic

\mathit{unconditional}

lower bound for

\mathsf{ANN}

against general (non black-box) data structures. We also show that any oblivious

\mathit{static}

data structure for decomposable search problems (like

\mathsf{ANN}

) can be obliviously dynamized with

O(\log n)

overhead in update and query time, strengthening a classic result of Bentley and Saxe (Algorithmica, 1980).Comment: 28 page

arXiv.org e-Print Archive

Crossref

Cryptology ePrint Archive

Preventing Location-Based Identity Inference in Anonymous Spatial Queries

Author: GHINITA Gabriel
KALNIS Panos
MOURATIDIS Kyriakos
PAPADIAS Dimitris
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

The increasing trend of embedding positioning capabilities (for example, GPS) in mobile devices facilitates the widespread use of Location-Based Services. For such applications to succeed, privacy and confidentiality are essential. Existing privacy-enhancing techniques rely on encryption to safeguard communication channels, and on pseudonyms to protect user identities. Nevertheless, the query contents may disclose the physical location of the user. In this paper, we present a framework for preventing location-based identity inference of users who issue spatial queries to Location-Based Services. We propose transformations based on the well-established K-anonymity concept to compute exact answers for range and nearest neighbor search, without revealing the query source. Our methods optimize the entire process of anonymizing the requests and processing the transformed spatial queries. Extensive experimental studies suggest that the proposed techniques are applicable to real-life scenarios with numerous mobile users

Institutional Knowledge at Singapore Management University

Hong Kong University of Science and Technology Institutional Repository

ScholarBank@NUS

A Comparison of Blocking Methods for Record Linkage

Author: A. Goldenberg
D. Vatsalan
H. Liang
L. Paulevé
M. Kuzu
P. Christen
P. Christen
P. Christen
R. Hall
S. Fortunato
T. Herzog
Publication venue
Publication date: 01/01/2014
Field of study

Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage. We review traditional blocking techniques, which typically partition the records according to a set of field attributes, and consider two variants of a method known as locality sensitive hashing, sometimes referred to as "private blocking." We compare these approaches in terms of their recall, reduction ratio, and computational complexity. We evaluate these methods using different synthetic datafiles and conclude with a discussion of privacy-related issues.Comment: 22 pages, 2 tables, 7 figure

arXiv.org e-Print Archive

Crossref