Search CORE

108 research outputs found

Dynamic Ordered Sets with Exponential Search Trees

Author: Andersson Arne
Thorup Mikkel
Publication venue
Publication date: 01/01/2002
Field of study

We introduce exponential search trees as a novel technique for converting static polynomial space search structures for ordered sets into fully-dynamic linear space data structures. This leads to an optimal bound of O(sqrt(log n/loglog n)) for searching and updating a dynamic set of n integer keys in linear space. Here searching an integer y means finding the maximum key in the set which is smaller than or equal to y. This problem is equivalent to the standard text book problem of maintaining an ordered set (see, e.g., Cormen, Leiserson, Rivest, and Stein: Introduction to Algorithms, 2nd ed., MIT Press, 2001). The best previous deterministic linear space bound was O(log n/loglog n) due Fredman and Willard from STOC 1990. No better deterministic search bound was known using polynomial space. We also get the following worst-case linear space trade-offs between the number n, the word length w, and the maximal key U < 2^w: O(min{loglog n+log n/log w, (loglog n)(loglog U)/(logloglog U)}). These trade-offs are, however, not likely to be optimal. Our results are generalized to finger searching and string searching, providing optimal results for both in terms of n.Comment: Revision corrects some typoes and state things better for applications in subsequent paper

arXiv.org e-Print Archive

CiteSeerX

Round-Hashing for Data Storage: Distributed Servers and External-Memory Tables

Author: Grossi Roberto
Versari Luca
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 26th Annual European Symposium on Algorithms (ESA 2018)
Publication date: 01/01/2018
Field of study

This paper proposes round-hashing, which is suitable for data storage on distributed servers and for implementing external-memory tables in which each lookup retrieves at most one single block of external memory, using a stash. For data storage, round-hashing is like consistent hashing as it avoids a full rehashing of the keys when new servers are added. Experiments show that the speed to serve requests is tenfold or more than the state of the art. In distributed data storage, this guarantees better throughput for serving requests and, moreover, greatly reduces decision times for which data should move to new servers as rescanning data is much faster

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Archivio della Ricerca - Università di Pisa

Dagstuhl Research Online Publication Server

Faster deterministic sorting and priority queues in linear space

Author: Thorup M.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/1997
Field of study

The RAM complexity of deterministic linear space sorting of integers in words is improved from

O(n\sqrt{\log n})

O(n(\log\log n)^2)

. No better bounds are known for polynomial space. In fact, the techniques give a deterministic linear space priority queue supporting insert and delete in

O((\log\log n)^2)

amortized time and find-min in constant time. The priority queue can be implemented using addition, shift, and bit-wise boolean operations

Copenhagen University Research Information System

MPG.PuRe

Answering Spatial Multiple-Set Intersection Queries Using 2-3 Cuckoo Hash-Filters

Author: Eppstein David
Hagerup Torben
Kopelowitz Tsvi
Yang Xiao
Publication venue
Publication date: 29/08/2017
Field of study

We show how to answer spatial multiple-set intersection queries in O(n(log w)/w + kt) expected time, where n is the total size of the t sets involved in the query, w is the number of bits in a memory word, k is the output size, and c is any fixed constant. This improves the asymptotic performance over previous solutions and is based on an interesting data structure, known as 2-3 cuckoo hash-filters. Our results apply in the word-RAM model (or practical RAM model), which allows for constant-time bit-parallel operations, such as bitwise AND, OR, NOT, and MSB (most-significant 1-bit), as exist in modern CPUs and GPUs. Our solutions apply to any multiple-set intersection queries in spatial data sets that can be reduced to one-dimensional range queries, such as spatial join queries for one-dimensional points or sets of points stored along space-filling curves, which are used in GIS applications.Comment: Full version of paper from 2017 ACM SIGSPATIAL International Conference on Advances in Geographic Information System

arXiv.org e-Print Archive

Crossref

Planar Reachability in Linear Space and Constant Time

Author: Holm Jacob
Rotenberg Eva
Thorup Mikkel
Publication venue
Publication date: 01/01/2015
Field of study

We show how to represent a planar digraph in linear space so that distance queries can be answered in constant time. The data structure can be constructed in linear time. This representation of reachability is thus optimal in both time and space, and has optimal construction time. The previous best solution used

O(n\log n)

space for constant query time [Thorup FOCS'01].Comment: 20 pages, 5 figures, submitted to FoC

arXiv.org e-Print Archive

CiteSeerX

Crossref

Copenhagen University Research Information System

Online Research Database In Technology

Static Data Structure Lower Bounds Imply Rigidity

Author: Agarwal Pankaj K.
Deterministic
Higher
Lower
Lower
Miltersen Peter Bro
Noisy
Valiant Leslie G.
van Emde Boas Peter
şcu Mihai P
Publication venue
Publication date: 13/02/2019
Field of study

We show that static data structure lower bounds in the group (linear) model imply semi-explicit lower bounds on matrix rigidity. In particular, we prove that an explicit lower bound of

t \geq \omega(\log^2 n)

on the cell-probe complexity of linear data structures in the group model, even against arbitrarily small linear space

(s= (1+\varepsilon)n)

, would already imply a semi-explicit (

\bf P^{NP}\rm

) construction of rigid matrices with significantly better parameters than the current state of art (Alon, Panigrahy and Yekhanin, 2009). Our results further assert that polynomial (

t\geq n^{\delta}

) data structure lower bounds against near-optimal space, would imply super-linear circuit lower bounds for log-depth linear circuits (a four-decade open question). In the succinct space regime

(s=n+o(n))

, we show that any improvement on current cell-probe lower bounds in the linear model would also imply new rigidity bounds. Our results rely on a new connection between the "inner" and "outer" dimensions of a matrix (Paturi and Pudlak, 2006), and on a new reduction from worst-case to average-case rigidity, which is of independent interest

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

Linear Hashing

Author: Alon Noga
Dietzfelbinger Martin
Miltersen Peter Bro
Petrank Erez
Tardos Gábor
Publication venue: 'Aarhus University Library'
Publication date: 16/01/1997
Field of study

Consider the set H of all linear (or affine) transformations between two vector spaces over a finite field F. We study how good H is as a class of hash functions, namely we consider hashing a set S of sizen into a range having the same cardinality n by a randomly chosen function from H and look at the expected size of the largest hash bucket. H is a universal class of hash functions for any finite field, butwith respect to our measure different fields behave differently

Tidsskrift.dk (Det Kongelige Bibliotek)

Integer priority queues with decrease key in constant time and the single source shortest paths problem

Author: Thorup Mikkel
Publication venue: Elsevier Inc. Published by Elsevier Inc.
Publication date: 30/11/2004
Field of study

AbstractWe consider Fibonacci heap style integer priority queues supporting find-min, insert, and decrease key operations in constant time. We present a deterministic linear space solution that with n integer keys supports delete in O(loglogn) time. If the integers are in the range [0,N), we can also support delete in O(loglogN) time.Even for the special case of monotone priority queues, where the minimum has to be non-decreasing, the best previous bounds on delete were O((logn)1/(3−ε)) and O((logN)1/(4−ε)). These previous bounds used both randomization and amortization. Our new bounds are deterministic, worst-case, with no restriction to monotonicity, and exponentially faster.As a classical application, for a directed graph with n nodes and m edges with non-negative integer weights, we get single source shortest paths in O(m+nloglogn) time, or O(m+nloglogC) if C is the maximal edge weight. The latter solves an open problem of Ahuja, Mehlhorn, Orlin, and Tarjan from 1990

Elsevier - Publisher Connector