Search CORE

11,839 research outputs found

Continuous Nearest Neighbor Queries over Sliding Windows

Author: MOURATIDIS Kyriakos
Papadias Dimitris
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

Abstract—This paper studies continuous monitoring of nearest neighbor (NN) queries over sliding window streams. According to this model, data points continuously stream in the system, and they are considered valid only while they belong to a sliding window that contains 1) the W most recent arrivals (count-based) or 2) the arrivals within a fixed interval W covering the most recent time stamps (time-based). The task of the query processor is to constantly maintain the result of long-running NN queries among the valid data. We present two processing techniques that apply to both count-based and time-based windows. The first one adapts conceptual partitioning, the best existing method for continuous NN monitoring over update streams, to the sliding window model. The second technique reduces the problem to skyline maintenance in the distance-time space and precomputes the future changes in the NN set. We analyze the performance of both algorithms and extend them to variations of NN search. Finally, we compare their efficiency through a comprehensive experimental evaluation. The skyline-based algorithm achieves lower CPU cost, at the expense of slightly larger space overhead. Index Terms—Location-dependent and sensitive, spatial databases, query processing, nearest neighbors, data streams, sliding windows.

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Hong Kong University of Science and Technology Institutional Repository

Lower Bounds for Oblivious Near-Neighbor Search

Author: Larsen Kasper Green
Malkin Tal
Weinstein Omri
Yeo Kevin
Publication venue
Publication date: 09/04/2019
Field of study

We prove an

\Omega(d \lg n/ (\lg\lg n)^2)

lower bound on the dynamic cell-probe complexity of statistically

\mathit{oblivious}

approximate-near-neighbor search (

\mathsf{ANN}

) over the

d

-dimensional Hamming cube. For the natural setting of

d = \Theta(\log n)

, our result implies an

\tilde{\Omega}(\lg^2 n)

lower bound, which is a quadratic improvement over the highest (non-oblivious) cell-probe lower bound for

\mathsf{ANN}

. This is the first super-logarithmic

\mathit{unconditional}

lower bound for

\mathsf{ANN}

against general (non black-box) data structures. We also show that any oblivious

\mathit{static}

data structure for decomposable search problems (like

\mathsf{ANN}

) can be obliviously dynamized with

O(\log n)

overhead in update and query time, strengthening a classic result of Bentley and Saxe (Algorithmica, 1980).Comment: 28 page

arXiv.org e-Print Archive

Crossref

Cryptology ePrint Archive

K-nearest neighbor search for fuzzy objects

Author: Fung Pui Cheong
Zheng Kai
Zhou Xiaofang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

The K-Nearest Neighbor search (kNN) problem has been investigated extensively in the past due to its broad range of applications. In this paper we study this problem in the context of fuzzy objects that have indeterministic boundaries. Fuzzy objects play an important role in many areas, such as biomedical image databases and GIS. Existing research on fuzzy objects mainly focuses on modelling basic fuzzy object types and operations, leaving the processing of more advanced queries such as kNN query untouched. In this paper, we propose two new kinds of kNN queries for fuzzy objects, Ad-hoc kNN query (AKNN) and Range kNN query (RKNN), to find the k nearest objects qualifying at a probability threshold or within a probability range. For efficient AKNN query processing, we optimize the basic best-first search algorithm by deriving more accurate approximations for the distance function between fuzzy objects and the query object. To improve the performance of RKNN search, effective pruning rules are developed to significantly reduce the search space and further speed up the candidate refinement process. The efficiency of our proposed algorithms as well as the optimization techniques are verified with an extensive set of experiments using both synthetic and real datasets

Crossref

University of Queensland eSpace