Search CORE

11,220 research outputs found

Fast evaluation of union-intersection expressions

Author: Bille Philip
Pagh Anna
Pagh Rasmus
Publication venue
Publication date: 01/01/2007
Field of study

We show how to represent sets in a linear space data structure such that expressions involving unions and intersections of sets can be computed in a worst-case efficient way. This problem has applications in e.g. information retrieval and database systems. We mainly consider the RAM model of computation, and sets of machine words, but also state our results in the I/O model. On a RAM with word size

w

, a special case of our result is that the intersection of

m

(preprocessed) sets, containing

n

elements in total, can be computed in expected time

O(n (\log w)^2 / w + km)

, where

k

is the number of elements in the intersection. If the first of the two terms dominates, this is a factor

w^{1-o(1)}

faster than the standard solution of merging sorted lists. We show a cell probe lower bound of time

\Omega(n/(w m \log m)+ (1-\tfrac{\log k}{w}) k)

, meaning that our upper bound is nearly optimal for small

m

. Our algorithm uses a novel combination of approximate set representations and word-level parallelism

arXiv.org e-Print Archive

CiteSeerX

The IT University of Copenhagen's Repository

QuickXsort: Efficient Sorting with n log n - 1.399n +o(n) Comparisons on Average

Author: A. Elmasry
C. Martínez
D. Cantone
D.R. Musser
I. Wegener
J. Ford
J. Katajainen
K. Reinhardt
M. Blum
R.D. Dutton
S. Edelkamp
V. Diekert
Publication venue
Publication date: 11/07/2013
Field of study

In this paper we generalize the idea of QuickHeapsort leading to the notion of QuickXsort. Given some external sorting algorithm X, QuickXsort yields an internal sorting algorithm if X satisfies certain natural conditions. With QuickWeakHeapsort and QuickMergesort we present two examples for the QuickXsort-construction. Both are efficient algorithms that incur approximately n log n - 1.26n +o(n) comparisons on the average. A worst case of n log n + O(n) comparisons can be achieved without significantly affecting the average case. Furthermore, we describe an implementation of MergeInsertion for small n. Taking MergeInsertion as a base case for QuickMergesort, we establish a worst-case efficient sorting algorithm calling for n log n - 1.3999n + o(n) comparisons on average. QuickMergesort with constant size base cases shows the best performance on practical inputs: when sorting integers it is slower by only 15% to STL-Introsort

arXiv.org e-Print Archive

Crossref