Search CORE

263 research outputs found

Cross-Sender Bit-Mixing Coding

Author: Chan C.
Cheraghchi M.
Du D.
Gilbert A.
Gilbert A.
Inan H.
Indyk P.
Lee K.
Mazumdar A.
Mazumdar Arya
Publication venue
Publication date: 01/01/2019
Field of study

Scheduling to avoid packet collisions is a long-standing challenge in networking, and has become even trickier in wireless networks with multiple senders and multiple receivers. In fact, researchers have proved that even {\em perfect} scheduling can only achieve

\mathbf{R} = O(\frac{1}{\ln N})

. Here

N

is the number of nodes in the network, and

\mathbf{R}

is the {\em medium utilization rate}. Ideally, one would hope to achieve

\mathbf{R} = \Theta(1)

, while avoiding all the complexities in scheduling. To this end, this paper proposes {\em cross-sender bit-mixing coding} ({\em BMC}), which does not rely on scheduling. Instead, users transmit simultaneously on suitably-chosen slots, and the amount of overlap in different user's slots is controlled via coding. We prove that in all possible network topologies, using BMC enables us to achieve

\mathbf{R}=\Theta(1)

. We also prove that the space and time complexities of BMC encoding/decoding are all low-order polynomials.Comment: Published in the International Conference on Information Processing in Sensor Networks (IPSN), 201

arXiv.org e-Print Archive

Crossref

NORA - Norwegian Open Research Archives

ScholarBank@NUS

Approximate Near Neighbors for General Symmetric Norms

Author: A
Andoni Alexandr
Argyriou Andreas
Beyond
Indyk Piotr
John Fritz
Kenneth
Krahmer Felix
Krauthgamer Robert
McDonald Andrew M.
Time–Space Optimal
Publication venue
Publication date: 24/07/2017
Field of study

We show that every symmetric normed space admits an efficient nearest neighbor search data structure with doubly-logarithmic approximation. Specifically, for every

n

d = n^{o(1)}

, and every

d

-dimensional symmetric norm

\|\cdot\|

, there exists a data structure for

\mathrm{poly}(\log \log n)

-approximate nearest neighbor search over

\|\cdot\|

for

n

-point datasets achieving

n^{o(1)}

query time and

n^{1+o(1)}

space. The main technical ingredient of the algorithm is a low-distortion embedding of a symmetric norm into a low-dimensional iterated product of top-

k

norms. We also show that our techniques cannot be extended to general norms.Comment: 27 pages, 1 figur

arXiv.org e-Print Archive

Crossref

Interval Selection in the Streaming Model

Author: AW Kolen
AZ Broder
BV Halldórsson
DS Hochbaum
E Kushilevitz
J Feigenbaum
M Datar
P Indyk
TS Jayram
Y Emek
Publication venue
Publication date: 04/02/2015
Field of study

A set of intervals is independent when the intervals are pairwise disjoint. In the interval selection problem we are given a set

\mathbb{I}

of intervals and we want to find an independent subset of intervals of largest cardinality. Let

\alpha(\mathbb{I})

denote the cardinality of an optimal solution. We discuss the estimation of

\alpha(\mathbb{I})

in the streaming model, where we only have one-time, sequential access to the input intervals, the endpoints of the intervals lie in

\{1,...,n \}

, and the amount of the memory is constrained. For intervals of different sizes, we provide an algorithm in the data stream model that computes an estimate

\hat\alpha

\alpha(\mathbb{I})

that, with probability at least

2/3

, satisfies

\tfrac 12(1-\varepsilon) \alpha(\mathbb{I}) \le \hat\alpha \le \alpha(\mathbb{I})

. For same-length intervals, we provide another algorithm in the data stream model that computes an estimate

\hat\alpha

\alpha(\mathbb{I})

that, with probability at least

2/3

, satisfies

\tfrac 23(1-\varepsilon) \alpha(\mathbb{I}) \le \hat\alpha \le \alpha(\mathbb{I})

. The space used by our algorithms is bounded by a polynomial in

\varepsilon^{-1}

and

\log n

. We also show that no better estimations can be achieved using

o(n)

bits of storage. We also develop new, approximate solutions to the interval selection problem, where we want to report a feasible solution, that use

O(\alpha(\mathbb{I}))

space. Our algorithms for the interval selection problem match the optimal results by Emek, Halld{\'o}rsson and Ros{\'e}n [Space-Constrained Interval Selection, ICALP 2012], but are much simpler.Comment: Minor correction

arXiv.org e-Print Archive

Crossref

Deterministic Sampling and Range Counting in Geometric Data Streams

Author: Amitabh Chaudhary
Amitabha Bagchi
Cormode G.
Datar M.
David Eppstein
Demaine E. D.
Fang M.
Feigenbaum J.
Gupta A.
Har-Peled S.
Hershberger J.
Indyk P.
Indyk P.
Korn F.
Langerman S.
Manku G. S.
Matoušek J.
Matoušek J.
Matoušek J.
Michael T. Goodrich
Rousseeuw P. J.
Thiel H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/07/2003
Field of study

We present memory-efficient deterministic algorithms for constructing epsilon-nets and epsilon-approximations of streams of geometric data. Unlike probabilistic approaches, these deterministic samples provide guaranteed bounds on their approximation factors. We show how our deterministic samples can be used to answer approximate online iceberg geometric queries on data streams. We use these techniques to approximate several robust statistics of geometric data streams, including Tukey depth, simplicial depth, regression depth, the Thiel-Sen estimator, and the least median of squares. Our algorithms use only a polylogarithmic amount of memory, provided the desired approximation factors are inverse-polylogarithmic. We also include a lower bound for non-iceberg geometric queries.Comment: 12 pages, 1 figur

arXiv.org e-Print Archive

CiteSeerX

Crossref

Pseudorandomness for Regular Branching Programs via Fourier Analysis

Author: A. Healy
C.J. Lu
E. Kaplan
E. Rozenman
G. Even
I. Haitner
J. Naor
J. Šíma
M. Saks
N. Linial
O. Reingold
P. Indyk
Publication venue
Publication date: 01/01/2013
Field of study

We present an explicit pseudorandom generator for oblivious, read-once, permutation branching programs of constant width that can read their input bits in any order. The seed length is

O(\log^2 n)

, where

n

is the length of the branching program. The previous best seed length known for this model was

n^{1/2+o(1)}

, which follows as a special case of a generator due to Impagliazzo, Meka, and Zuckerman (FOCS 2012) (which gives a seed length of

s^{1/2+o(1)}

for arbitrary branching programs of size

s

). Our techniques also give seed length

n^{1/2+o(1)}

for general oblivious, read-once branching programs of width

2^{n^{o(1)}}

, which is incomparable to the results of Impagliazzo et al.Our pseudorandom generator is similar to the one used by Gopalan et al. (FOCS 2012) for read-once CNFs, but the analysis is quite different; ours is based on Fourier analysis of branching programs. In particular, we show that an oblivious, read-once, regular branching program of width

w

has Fourier mass at most

(2w^2)^k

at level

k

, independent of the length of the program.Comment: RANDOM 201

arXiv.org e-Print Archive

Crossref

Taylor Polynomial Estimator for Estimating Frequency Moments

Author: A Andoni
M Charikar
N Alon
P Indyk
R Singh
S Ganguly
Y Li
Publication venue
Publication date: 03/06/2015
Field of study

We present a randomized algorithm for estimating the

p

th moment

F_p

of the frequency vector of a data stream in the general update (turnstile) model to within a multiplicative factor of

1 \pm \epsilon

, for

p > 2

, with high constant confidence. For

0 < \epsilon \le 1

, the algorithm uses space

O( n^{1-2/p} \epsilon^{-2} + n^{1-2/p} \epsilon^{-4/p} \log (n))

words. This improves over the current bound of

O(n^{1-2/p} \epsilon^{-2-4/p} \log (n))

words by Andoni et. al. in \cite{ako:arxiv10}. Our space upper bound matches the lower bound of Li and Woodruff \cite{liwood:random13} for

\epsilon = (\log (n))^{-\Omega(1)}

and the lower bound of Andoni et. al. \cite{anpw:icalp13} for

\epsilon = \Omega(1)

.Comment: Supercedes arXiv:1104.4552. Extended Abstract of this paper to appear in Proceedings of ICALP 201

arXiv.org e-Print Archive

Crossref

Distance-Sensitive Hashing

Author: Alexandr Andoni Piotr Indyk
Andoni Alexandr
Andoni Alexandr
Broder Andrei Z.
Chierichetti Flavio
Cristofaro Emiliano De
Indyk Piotr
Jain Prateek
Liu Wei
Neyshabur Behnam
Rahimi Ali
Riazi M. Sadegh
Savage Richard I.
Shrivastava Anshumali
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Locality-sensitive hashing (LSH) is an important tool for managing high-dimensional noisy or uncertain data, for example in connection with data cleaning (similarity join) and noise-robust search (similarity search). However, for a number of problems the LSH framework is not known to yield good solutions, and instead ad hoc solutions have been designed for particular similarity and distance measures. For example, this is true for output-sensitive similarity search/join, and for indexes supporting annulus queries that aim to report a point close to a certain given distance from the query point. In this paper we initiate the study of distance-sensitive hashing (DSH), a generalization of LSH that seeks a family of hash functions such that the probability of two points having the same hash value is a given function of the distance between them. More precisely, given a distance space

(X, \text{dist})

and a "collision probability function" (CPF)

f\colon \mathbb{R}\rightarrow [0,1]

we seek a distribution over pairs of functions

(h,g)

such that for every pair of points

x, y \in X

the collision probability is

\Pr[h(x)=g(y)] = f(\text{dist}(x,y))

. Locality-sensitive hashing is the study of how fast a CPF can decrease as the distance grows. For many spaces,

f

can be made exponentially decreasing even if we restrict attention to the symmetric case where

g=h

. We show that the asymmetry achieved by having a pair of functions makes it possible to achieve CPFs that are, for example, increasing or unimodal, and show how this leads to principled solutions to problems not addressed by the LSH framework. This includes a novel application to privacy-preserving distance estimation. We believe that the DSH framework will find further applications in high-dimensional data management.Comment: Accepted at PODS'18. Abstract shortened due to character limi

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

The IT University of Copenhagen's Repository

Archivio istituzionale della ricerca - Università di Padova

Micronutrients in HIV: A Bayesian MetaAnalysis

Author: Andreae Michael
Busani Sudharani
Carter George M.
Esmaeili Aryan
Indyk Debbie
Johnson Matthew S.
Sacks Henry S.
Suslov Kathryn
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2015
Field of study

Background: Approximately 28.5 million people living with HIV are eligible for treatment (CD4&500), but currently have no access to antiretroviral therapy. Reduced serum level of micronutrients is common in HIV disease. Micronutrient supplementation (MNS) may mitigate disease progression and mortality. Objectives: We synthesized evidence on the effect of micronutrient supplementation on mortality and rate of disease progression in HIV disease. Methods: We searched MEDLINE, EMBASE, the Cochrane Central, AMED and CINAHL databases through December 2014, without language restriction, for studies of greater than 3 micronutrients versus any or no comparator. We built a hierarchical Bayesian random effects model to synthesize results. Inferences are based on the posterior distribution of the population effects; posterior distributions were approximated by Markov chain Monte Carlo in OpenBugs. Principal Findings: From 2166 initial references, we selected 49 studies for full review and identified eight reporting on disease progression and/or mortality. Bayesian synthesis of data from 2,249 adults in three studies estimated the relative risk of disease progression in subjects on MNS vs. control as 0.62 (95% credible interval, 0.37, 0.96). Median number needed to treat is 8.4 (4.8, 29.9) and the Bayes Factor 53.4. Based on data reporting on 4,095 adults reporting mortality in 7 randomized controlled studies, the RR was 0.84 (0.38, 1.85), NNT is 25 (4.3, ∞). Conclusions: MNS significantly and substantially slows disease progression in HIV+ adults not on ARV, and possibly reduces mortality. Micronutrient supplements are effective in reducing progression with a posterior probability of 97.9%. Considering MNS low cost and lack of adverse effects, MNS should be standard of care for HIV+ adults not yet on ARV

Crossref

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

FigShare

Revisiting the Direct Sum Theorem and Space Lower Bounds in Random Order Streams

Author: A. Chakrabarti
A. Chakrabarti
E. Kushilevitz
L. Bhuvanagiri
M. Saks
N. Alon
P. Indyk
S. Guha
S. Guha
Z. Bar-Yossef
Publication venue: ScholarlyCommons
Publication date: 01/01/2009
Field of study

Estimating frequency moments and

L_p

distances are well studied problems in the adversarial data stream model and tight space bounds are known for these two problems. There has been growing interest in revisiting these problems in the framework of random-order streams. The best space lower bound known for computing the

k^{th}

frequency moment in random-order streams is

\Omega(n^{1-2.5/k})

by Andoni et al., and it is conjectured that the real lower bound shall be

\Omega(n^{1-2/k})

. In this paper, we resolve this conjecture. In our approach, we revisit the direct sum theorem developed by Bar-Yossef et al. in a random-partition private messages model and provide a tight

\Omega(n^{1-2/k}/\ell)

space lower bound for any

\ell

-pass algorithm that approximates the frequency moment in random-order stream model to a constant factor. Finally, we also introduce the notion of space-entropy tradeoffs in random order streams, as a means of studying intermediate models between adversarial and fully random order streams. We show an almost tight space-entropy tradeoff for

L_\infty

distance and a non-trivial tradeoff for

L_p

distances

CiteSeerX

Crossref

ScholarlyCommons@Penn

HKU Scholars Hub

The US Commitments to NATO in the Post-Cold War Period - A Case Study on Libya

Author: Clinton H.
Indyk M.
Mann J.
Schake K.
Yanan Song
Publication venue: 'Informa UK Limited'
Publication date: 27/04/2016
Field of study

The recent history of the US commitment to NATO has been dominated by economic pressures, squabbles over NATO’s military performance in Afghanistan, and the apparent American preference for ‘leading from behind’ in Libya. The case study on Libya will be especially important in exploring the Obama administration’s understanding of the purpose of NATO in the context of current economic pressures, domestic US debates about post-War on Terror interventions, and of increasing American preoccupation with Pacific (rather than European) security. In the case of Libya, the US apparently hesitated to unfold military operations against Libyan military targets. It seems to be the first time that the US followed rather than led its European allies to a campaign. The reason why the US was reluctant to intervene in Libya at the very beginning; why it changed its mind to join the operation later; and why it transferred the Libyan mission to NATO and adopted the strategy of ‘leading from behind’, reflected on not only the redefinition of ‘American way of war’, but also the future of NATO

Crossref

SOAS Research Online