Search CORE

6 research outputs found

Approximate Nearest Neighbor Search Amid Higher-Dimensional Flats

Author: Agarwal Pankaj K.
Rubin Natan
Sharir Micha
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 25th Annual European Symposium on Algorithms (ESA 2017)
Publication date: 01/01/2017
Field of study

We consider the Approximate Nearest Neighbor (ANN) problem where the input set consists of n k-flats in the Euclidean Rd, for any fixed parameters k 0 is another prespecified parameter. We present an algorithm that achieves this task with n^{k+1}(log(n)/epsilon)^O(1) storage and preprocessing (where the constant of proportionality in the big-O notation depends on d), and can answer a query in O(polylog(n)) time (where the power of the logarithm depends on d and k). In particular, we need only near-quadratic storage to answer ANN queries amidst a set of n lines in any fixed-dimensional Euclidean space. As a by-product, our approach also yields an algorithm, with similar performance bounds, for answering exact nearest neighbor queries amidst k-flats with respect to any polyhedral distance function. Our results are more general, in that they also provide a tradeoff between storage and query time

Dagstuhl Research Online Publication Server

Approximate Sparse Linear Regression

Author: Har-Peled Sariel
Indyk Piotr
Mahabadi Sepideh
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018)
Publication date: 01/01/2018
Field of study

In the Sparse Linear Regression (SLR) problem, given a d x n matrix M and a d-dimensional query q, the goal is to compute a k-sparse n-dimensional vector tau such that the error ||M tau - q|| is minimized. This problem is equivalent to the following geometric problem: given a set P of n points and a query point q in d dimensions, find the closest k-dimensional subspace to q, that is spanned by a subset of k points in P. In this paper, we present data-structures/algorithms and conditional lower bounds for several variants of this problem (such as finding the closest induced k dimensional flat/simplex instead of a subspace). In particular, we present approximation algorithms for the online variants of the above problems with query time O~(n^{k-1}), which are of interest in the "low sparsity regime" where k is small, e.g., 2 or 3. For k=d, this matches, up to polylogarithmic factors, the lower bound that relies on the affinely degenerate conjecture (i.e., deciding if n points in R^d contains d+1 points contained in a hyperplane takes Omega(n^d) time). Moreover, our algorithms involve formulating and solving several geometric subproblems, which we believe to be of independent interest

arXiv.org e-Print Archive

DSpace@MIT

Dagstuhl Research Online Publication Server

Sparse Regression via Range Counting

Author: Cardinal Jean
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 17th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT 2020)
Publication date: 01/01/2020
Field of study

The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set S of n points in ?^d, a point y? ?^d, and an integer 2 ? k ? d, find an affine combination of at most k points of S that is nearest to y. We describe a O(n^{k-1} log^{d-k+2} n)-time randomized (1+?)-approximation algorithm for this problem with d and ? constant. This is the first algorithm for this problem running in time o(n^k). Its running time is similar to the query time of a data structure recently proposed by Har-Peled, Indyk, and Mahabadi (ICALP\u2718), while not requiring any preprocessing. Up to polylogarithmic factors, it matches a conditional lower bound relying on a conjecture about affine degeneracy testing. In the special case where k = d = O(1), we provide a simple O_?(n^{d-1+?})-time deterministic exact algorithm, for any ? > 0. Finally, we show how to adapt the approximation algorithm for the sparse linear regression and sparse convex regression problems with the same running time, up to polylogarithmic factors

Dagstuhl Research Online Publication Server

Approximate Nearest-Neighbor Search for Line Segments

Author: Abdelkader Ahmed
Mount David M.
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 37th International Symposium on Computational Geometry (SoCG 2021)
Publication date: 01/01/2021
Field of study

Approximate nearest-neighbor search is a fundamental algorithmic problem that continues to inspire study due its essential role in numerous contexts. In contrast to most prior work, which has focused on point sets, we consider nearest-neighbor queries against a set of line segments in

\mathbb{R}^d

, for constant dimension

d

. Given a set

S

n

disjoint line segments in

\mathbb{R}^d

and an error parameter

\varepsilon > 0

, the objective is to build a data structure such that for any query point

q

, it is possible to return a line segment whose Euclidean distance from

q

is at most

(1+\varepsilon)

times the distance from

q

to its nearest line segment. We present a data structure for this problem with storage

O((n^2/\varepsilon^{d}) \log (\Delta/\varepsilon))

and query time

O(\log (\max(n,\Delta)/\varepsilon))

, where

\Delta

is the spread of the set of segments

S

. Our approach is based on a covering of space by anisotropic elements, which align themselves according to the orientations of nearby segments.Comment: 20 pages (including appendix), 5 figure

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server