Search CORE

147 research outputs found

Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming

Author: Kacham Praneeth
Pagh Rasmus
Thorup Mikkel
Woodruff David P.
Publication venue
Publication date: 04/01/2024
Field of study

We revisit Nisan's classical pseudorandom generator (PRG) for space-bounded computation (STOC 1990) and its applications in streaming algorithms. We describe a new generator, HashPRG, that can be thought of as a symmetric version of Nisan's generator over larger alphabets. Our generator allows a trade-off between seed length and the time needed to compute a given block of the generator's output. HashPRG can be used to obtain derandomizations with much better update time and \emph{without sacrificing space} for a large number of data stream algorithms, such as

F_p

estimation in the parameter regimes

p > 2

and

0 < p < 2

and CountSketch with tight estimation guarantees as analyzed by Minton and Price (SODA 2014) which assumed access to a random oracle. We also show a recent analysis of Private CountSketch can be derandomized using our techniques. For a

d

-dimensional vector

x

being updated in a turnstile stream, we show that

\|x\|_{\infty}

can be estimated up to an additive error of

\varepsilon\|x\|_{2}

using

O(\varepsilon^{-2}\log(1/\varepsilon)\log d)

bits of space. Additionally, the update time of this algorithm is

O(\log 1/\varepsilon)

in the Word RAM model. We show that the space complexity of this algorithm is optimal up to constant factors. However, for vectors

x

with

\|x\|_{\infty} = \Theta(\|x\|_{2})

, we show that the lower bound can be broken by giving an algorithm that uses

O(\varepsilon^{-2}\log d)

bits of space which approximates

\|x\|_{\infty}

up to an additive error of

\varepsilon\|x\|_{2}

. We use our aforementioned derandomization of the CountSketch data structure to obtain this algorithm, and using the time-space trade off of HashPRG, we show that the update time of this algorithm is also

O(\log 1/\varepsilon)

in the Word RAM model.Comment: Minor writing improvement

arXiv.org e-Print Archive

Efficient Dynamic Approximate Distance Oracles for Vertex-Labeled Planar Graphs

Author: BT Wilkinson
DE Willard
M Li
M Thorup
ML Fredman
MR Henzinger
Q-P Gu
R Pagh
RJ Lipton
S Mozes
Publication venue
Publication date: 27/08/2017
Field of study

Let

G

be a graph where each vertex is associated with a label. A Vertex-Labeled Approximate Distance Oracle is a data structure that, given a vertex

v

and a label

\lambda

, returns a

(1+\varepsilon)

-approximation of the distance from

v

to the closest vertex with label

\lambda

G

. Such an oracle is dynamic if it also supports label changes. In this paper we present three different dynamic approximate vertex-labeled distance oracles for planar graphs, all with polylogarithmic query and update times, and nearly linear space requirements

arXiv.org e-Print Archive

Crossref

On the k-Independence Required by Linear Probing and Minwise Independence

Author: A. Pagh
A.Z. Broder
A.Z. Broder
E. Cohen
J.P. Schmidt
M.N. Wegman
P. Indyk
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref

Optical and electrical properties of polycrystalline and amorphous Al-Ti thin films

Author: C. N. Borca
Deng W.
J. Schou
K. Pagh Almtoft
K. Rechendorff
L. P. Nielsen
S. Canulescu
S. Daviðsdóttir
Publication venue: 'AIP Publishing'
Publication date: 01/01/2016
Field of study

Crossref

Online Research Database In Technology

Efficiently Correcting Matrix Products

Author: A De Bonis
A Lingas
A Schönhage
Andrzej Lingas
Christos Levcopoulos
D Coppersmith
DG Cantor
DZ Du
J Naor
JL Carter
Leszek Gąsieniec
MA Iwen
P Wu
R Pagh
Rasmus Pagh
RM McConnell
T Kimbrel
Takeshi Tokuyama
V Strassen
X Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

We study the problem of efficiently correcting an erroneous product of two

n\times n

matrices over a ring. Among other things, we provide a randomized algorithm for correcting a matrix product with at most

k

erroneous entries running in

\tilde{O}(n^2+kn)

time and a deterministic

\tilde{O}(kn^2)

-time algorithm for this problem (where the notation

\tilde{O}

suppresses polylogarithmic terms in

n

and

k

).Comment: Fixed invalid reference to figure in v

arXiv.org e-Print Archive

University of Liverpool Repository

Lund University Publications

Crossref

Springer - Publisher Connector

The IT University of Copenhagen's Repository

Wear Minimization for Cuckoo Hashing: How Not to Throw a Lot of Eggs into One Basket

Author: A. Ben-Aroya
A. Kirsch
A.M. Frieze
A.M. Frieze
D. Fotakis
E. Lehman
H.-S.P. Wong
J. Schmidt-Pruzan
L. Devroye
M. Dietzfelbinger
M. Karoński
P. Pavan
R. Bez
R. Pagh
S. Irani
Y. Arbitman
Y. Azar
Y.-H. Chang
Publication venue
Publication date: 01/01/2014
Field of study

We study wear-leveling techniques for cuckoo hashing, showing that it is possible to achieve a memory wear bound of

\log\log n+O(1)

after the insertion of

n

items into a table of size

Cn

for a suitable constant

C

using cuckoo hashing. Moreover, we study our cuckoo hashing method empirically, showing that it significantly improves on the memory wear performance for classic cuckoo hashing and linear probing in practice.Comment: 13 pages, 1 table, 7 figures; to appear at the 13th Symposium on Experimental Algorithms (SEA 2014

arXiv.org e-Print Archive

Crossref

Dynamic Compressed Strings with Random Access

Author: A. Brodnik
G. Manzini
J. Barbay
J. Jansson
M. Dietzfelbinger
M. Pǎtraşcu
P. Ferragina
P. Ferragina
R. González
R. Grossi
R. Grossi
R. Pagh
R. Pagh
R. Raman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

We consider the problem of storing a string S in dynamic compressed form, while permitting operations directly on the compressed representation of S: access a substring of S; replace, insert or delete a symbol in S; count how many occurrences of a given symbol appear in any given prefix of S (called rank operation) and locate the position of the ith occurrence of a symbol inside S (called select operation). We discuss the time complexity of several combinations of these operations along with the entropy space bounds of the corresponding compressed indexes. In this way, we extend or improve the bounds of previous work by Ferragina and Venturini [TCS, 2007], Jansson et al. [ICALP, 2012], and Nekrich and Navarro [SODA, 2013]

Crossref

Archivio della Ricerca - Università di Pisa

Leicester Research Archive