Analysis of approximate nearest neighbor searching with clustered point
  sets

Maneewongvatana, Songrit; Mount, David M.

research

Analysis of approximate nearest neighbor searching with clustered point sets

Authors: Songrit Maneewongvatana
David M. Mount
Publication date: 1 January 1999
Publisher

Abstract

We present an empirical analysis of data structures for approximate nearest neighbor searching. We compare the well-known optimized kd-tree splitting method against two alternative splitting methods. The first, called the sliding-midpoint method, which attempts to balance the goals of producing subdivision cells of bounded aspect ratio, while not producing any empty cells. The second, called the minimum-ambiguity method is a query-based approach. In addition to the data points, it is also given a training set of query points for preprocessing. It employs a simple greedy algorithm to select the splitting plane that minimizes the average amount of ambiguity in the choice of the nearest neighbor for the training points. We provide an empirical analysis comparing these two methods against the optimized kd-tree construction for a number of synthetically generated data and query sets. We demonstrate that for clustered data and query sets, these algorithms can provide significant improvements over the standard kd-tree construction for approximate nearest neighbor searching.Comment: 20 pages, 8 figures. Presented at ALENEX '99, Baltimore, MD, Jan 15-16, 199

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.7.685...

Last time updated on 22/10/2014