Search CORE

4 research outputs found

Eclipse: Practicability Beyond kNN and Skyline

Author: Liu Jinfei
Luo Jun
Pei Jian
Xiong Li
Zhang Qiuchen
Publication venue
Publication date: 16/10/2018
Field of study

The

k

nearest neighbor (

k

NN) query is a fundamental problem in databases. Given a set of multidimensional data points and a query point,

k

NN returns the

k

nearest neighbors based on a scoring function such as weighted sum given an attribute weight vector. However, the attribute weight vector can be difficult to specify in practice. Skyline returns the points including all possible nearest neighbors without requiring the exact attribute weight vector or a scoring function but the number of returned points can be prohibitively large for practical use. In this paper, we propose a novel \emph{eclipse} definition which provides a more flexible and customizable definition than the classic

1

NN and skyline. In eclipse, users can specify a range of attribute weights and control the number of returned points. We show that both

1

NN and skyline are instantiations of eclipse. To compute eclipse points, we propose a baseline algorithm with time complexity of

O(n^22^{d-1})

, and an improved

O(n\log ^{d-1}n)

time transformation-based algorithm by transforming the eclipse problem to the skyline problem, where

n

is the number of points and

d

is the number of dimensions. Furthermore, we propose a novel index-based algorithm utilizing duality transform with much better efficiency. The experimental results on the real NBA dataset and the synthetic datasets demonstrate the effectiveness and efficiency of our eclipse algorithms

arXiv.org e-Print Archive

Skyline Diagram: Efficient Space Partitioning for Skyline Queries

Author: Fan Chenglin
Guo Yuzhang
Liu Jinfei
Luo Jun
Ma Shuaicheng
Pei Jian
Xiong Li
Yang Juncheng
Publication venue
Publication date: 04/12/2018
Field of study

Skyline queries are important in many application domains. In this paper, we propose a novel structure Skyline Diagram, which given a set of points, partitions the plane into a set of regions, referred to as skyline polyominos. All query points in the same skyline polyomino have the same skyline query results. Similar to

k^{th}

-order Voronoi diagram commonly used to facilitate

k

nearest neighbor (

k

NN) queries, skyline diagram can be used to facilitate skyline queries and many other applications. However, it may be computationally expensive to build the skyline diagram. By exploiting some interesting properties of skyline, we present several efficient algorithms for building the diagram with respect to three kinds of skyline queries, quadrant, global, and dynamic skylines. In addition, we propose an approximate skyline diagram which can significantly reduce the space cost. Experimental results on both real and synthetic datasets show that our algorithms are efficient and scalable

arXiv.org e-Print Archive

Eclipse: Generalizing kNN and Skyline

Author: Liu Jinfei
Luo Jun
Pei Jian
Xiong Li
Zhang Qiuchen
Publication venue
Publication date: 14/06/2019
Field of study

k

nearest neighbor (

k

NN) queries and skyline queries are important operators on multi-dimensional data points. Given a query point,

k

NN query returns the

k

nearest neighbors based on a scoring function such as a weighted sum of the attributes, which requires predefined attribute weights (or preferences). Skyline query returns all possible nearest neighbors for any monotonic scoring functions without requiring attribute weights but the number of returned points can be prohibitively large. We observe that both

k

NN and skyline are inflexible and cannot be easily customized. In this paper, we propose a novel \emph{eclipse} operator that generalizes the classic

1

NN and skyline queries and provides a more flexible and customizable query solution for users. In eclipse, users can specify rough and customizable attribute preferences and control the number of returned points. We show that both

1

NN and skyline are instantiations of eclipse. To process eclipse queries, we propose a baseline algorithm with time complexity

O(n^22^{d-1})

, and an improved

O(n\log ^{d-1}n)

time transformation-based algorithm, where

n

is the number of points and

d

arXiv.org e-Print Archive

Secure and Efficient Skyline Queries on Encrypted Data

Author: Liu Jinfei
Pei Jian
Xiong Li
Yang Juncheng
Publication venue
Publication date: 04/06/2018
Field of study

Outsourcing data and computation to cloud server provides a cost-effective way to support large scale data storage and query processing. However, due to security and privacy concerns, sensitive data (e.g., medical records) need to be protected from the cloud server and other unauthorized users. One approach is to outsource encrypted data to the cloud server and have the cloud server perform query processing on the encrypted data only. It remains a challenging task to support various queries over encrypted data in a secure and efficient way such that the cloud server does not gain any knowledge about the data, query, and query result. In this paper, we study the problem of secure skyline queries over encrypted data. The skyline query is particularly important for multi-criteria decision making but also presents significant challenges due to its complex computations. We propose a fully secure skyline query protocol on data encrypted using semantically-secure encryption. As a key subroutine, we present a new secure dominance protocol, which can be also used as a building block for other queries. Furthermore, we demonstrate two optimizations, data partitioning and lazy merging, to further reduce the computation load. Finally, we provide both serial and parallelized implementations and empirically study the protocols in terms of efficiency and scalability under different parameter settings, verifying the feasibility of our proposed solutions.Comment: 16 page

arXiv.org e-Print Archive