Search CORE

1,598 research outputs found

Maximum Inner-Product Search using Tree Data-structures

Author: Gray Alexander G.
Ram Parikshit
Publication venue
Publication date: 01/01/2012
Field of study

The problem of {\em efficiently} finding the best match for a query in a given set with respect to the Euclidean distance or the cosine similarity has been extensively studied in literature. However, a closely related problem of efficiently finding the best match with respect to the inner product has never been explored in the general setting to the best of our knowledge. In this paper we consider this general problem and contrast it with the existing best-match algorithms. First, we propose a general branch-and-bound algorithm using a tree data structure. Subsequently, we present a dual-tree algorithm for the case where there are multiple queries. Finally we present a new data structure for increasing the efficiency of the dual-tree algorithm. These branch-and-bound algorithms involve novel bounds suited for the purpose of best-matching with inner products. We evaluate our proposed algorithms on a variety of data sets from various applications, and exhibit up to five orders of magnitude improvement in query time over the naive search technique.Comment: Under submission in KDD 201

arXiv.org e-Print Archive

CiteSeerX

A new Lenstra-type Algorithm for Quasiconvex Polynomial Integer Minimization with Complexity 2^O(n log n)

Author: Ajtai
Anstreicher
Anstreicher
Banaszczyk
Banaszczyk
Bank
Bertsimas
Boyd
Burden
Dinur
Eisenbrand
Grötschel
Heinz
Helfrich
Henk
John
Kannan
Kannan
Kannan
Khachiyan
Khachiyan
Khinchin
Kochol
Kochol
Kumar
Lenstra
Matthias Köppe
Micciancio
Micciancio
Nesterov
Nguyen
Robert Hildebrand
Storjohann
Todd
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

We study the integer minimization of a quasiconvex polynomial with quasiconvex polynomial constraints. We propose a new algorithm that is an improvement upon the best known algorithm due to Heinz (Journal of Complexity, 2005). This improvement is achieved by applying a new modern Lenstra-type algorithm, finding optimal ellipsoid roundings, and considering sparse encodings of polynomials. For the bounded case, our algorithm attains a time-complexity of s (r l M d)^{O(1)} 2^{2n log_2(n) + O(n)} when M is a bound on the number of monomials in each polynomial and r is the binary encoding length of a bound on the feasible region. In the general case, s l^{O(1)} d^{O(n)} 2^{2n log_2(n) +O(n)}. In each we assume d>= 2 is a bound on the total degree of the polynomials and l bounds the maximum binary encoding size of the input.Comment: 28 pages, 10 figure

arXiv.org e-Print Archive

CiteSeerX

Elsevier - Publisher Connector

Crossref

GOGMA: Globally-Optimal Gaussian Mixture Alignment

Author: Campbell Dylan
Petersson Lars
Publication venue
Publication date: 01/03/2016
Field of study

Gaussian mixture alignment is a family of approaches that are frequently used for robustly solving the point-set registration problem. However, since they use local optimisation, they are susceptible to local minima and can only guarantee local optimality. Consequently, their accuracy is strongly dependent on the quality of the initialisation. This paper presents the first globally-optimal solution to the 3D rigid Gaussian mixture alignment problem under the L2 distance between mixtures. The algorithm, named GOGMA, employs a branch-and-bound approach to search the space of 3D rigid motions SE(3), guaranteeing global optimality regardless of the initialisation. The geometry of SE(3) was used to find novel upper and lower bounds for the objective function and local optimisation was integrated into the scheme to accelerate convergence without voiding the optimality guarantee. The evaluation empirically supported the optimality proof and showed that the method performed much more robustly on two challenging datasets than an existing globally-optimal registration solution.Comment: Manuscript in press 2016 IEEE Conference on Computer Vision and Pattern Recognitio

arXiv.org e-Print Archive

Crossref

Optimal Data-Dependent Hashing for Approximate Near Neighbors

Author: Andoni A.
Johnson W. B.
N. R. Council
O'Donnell R.
Razenshteyn I.
Verma N.
Weiss Y.
Publication venue
Publication date: 15/07/2015
Field of study

We show an optimal data-dependent hashing scheme for the approximate near neighbor problem. For an

n

-point data set in a

d

-dimensional space our data structure achieves query time

O(d n^{\rho+o(1)})

and space

O(n^{1+\rho+o(1)} + dn)

, where

\rho=\tfrac{1}{2c^2-1}

for the Euclidean space and approximation

c>1

. For the Hamming space, we obtain an exponent of

\rho=\tfrac{1}{2c-1}

. Our result completes the direction set forth in [AINR14] who gave a proof-of-concept that data-dependent hashing can outperform classical Locality Sensitive Hashing (LSH). In contrast to [AINR14], the new bound is not only optimal, but in fact improves over the best (optimal) LSH data structures [IM98,AI06] for all approximation factors

c>1

. From the technical perspective, we proceed by decomposing an arbitrary dataset into several subsets that are, in a certain sense, pseudo-random.Comment: 36 pages, 5 figures, an extended abstract appeared in the proceedings of the 47th ACM Symposium on Theory of Computing (STOC 2015

arXiv.org e-Print Archive

Crossref