Search CORE

2 research outputs found

Geometric group testing

Author: Berendsohn Benjamin Aram
Kozma László
Publication venue
Publication date: 02/12/2020
Field of study

Group testing is concerned with identifying

t

defective items in a set of

m

items, where each test reports whether a specific subset of items contains at least one defective. In non-adaptive group testing, the subsets to be tested are fixed in advance. By testing multiple items at once, the required number of tests can be made much smaller than

m

. In fact, for

t \in \mathcal{O}(1)

, the optimal number of (non-adaptive) tests is known to be

\Theta(\log{m})

. In this paper, we consider the problem of non-adaptive group testing in a geometric setting, where the items are points in

d

-dimensional Euclidean space and the tests are axis-parallel boxes (hyperrectangles). We present upper and lower bounds on the required number of tests under this geometric constraint. In contrast to the general, combinatorial case, the bounds in our geometric setting are polynomial in

m

. For instance, our results imply that identifying a defective pair in a set of

m

points in the plane always requires

\Omega(m^{3/5})

tests, and there exist configurations of

m

points for which

\mathcal{O}(m^{2/3})

tests are sufficient, whereas to identify a single defective point in the plane,

\Theta(m^{1/2})

tests are always necessary and sometimes sufficient

arXiv.org e-Print Archive

On Adaptive Distance Estimation

Author: Cherapanamjeri Yeshwanth
Nelson Jelani
Publication venue
Publication date: 16/12/2020
Field of study

We provide a static data structure for distance estimation which supports {\it adaptive} queries. Concretely, given a dataset

X = \{x_i\}_{i = 1}^n

n

points in

\mathbb{R}^d

and

0 < p \leq 2

, we construct a randomized data structure with low memory consumption and query time which, when later given any query point

q \in \mathbb{R}^d

, outputs a

(1+\epsilon)

-approximation of

\lVert q - x_i \rVert_p

with high probability for all

i\in[n]

. The main novelty is our data structure's correctness guarantee holds even when the sequence of queries can be chosen adaptively: an adversary is allowed to choose the

j

th query point

q_j

in a way that depends on the answers reported by the data structure for

q_1,\ldots,q_{j-1}

. Previous randomized Monte Carlo methods do not provide error guarantees in the setting of adaptively chosen queries. Our memory consumption is

\tilde O((n+d)d/\epsilon^2)

, slightly more than the

O(nd)

required to store

X

in memory explicitly, but with the benefit that our time to answer queries is only

\tilde O(\epsilon^{-2}(n + d))

, much faster than the naive

\Theta(nd)

time obtained from a linear scan in the case of

n

and

d

very large. Here

\tilde O

hides

\log(nd/\epsilon)

factors. We discuss applications to nearest neighbor search and nonparametric estimation. Our method is simple and likely to be applicable to other domains: we describe a generic approach for transforming randomized Monte Carlo data structures which do not support adaptive queries to ones that do, and show that for the problem at hand, it can be applied to standard nonadaptive solutions to

\ell_p

norm estimation with negligible overhead in query time and a factor

d

overhead in memory.Comment: Minor correction in proof of Lemma B.

arXiv.org e-Print Archive