Search CORE

1,585 research outputs found

Optimal cache-aware suffix selection

Author: Franceschini Gianni
Grossi Roberto
Muthukrishnan S.
Publication venue
Publication date: 01/01/2009
Field of study

Given string

S[1..N]

and integer

k

, the {\em suffix selection} problem is to determine the

k

th lexicographically smallest amongst the suffixes

S[i... N]

1 \leq i \leq N

. We study the suffix selection problem in the cache-aware model that captures two-level memory inherent in computing systems, for a \emph{cache} of limited size

M

and block size

B

. The complexity of interest is the number of block transfers. We present an optimal suffix selection algorithm in the cache-aware model, requiring \Thetah{N/B} block transfers, for any string

S

over an unbounded alphabet (where characters can only be compared), under the common tall-cache assumption (i.e. M=\Omegah{B^{1+\epsilon}}, where

\epsilon<1

). Our algorithm beats the bottleneck bound for permuting an input array to the desired output array, which holds for nearly any nontrivial problem in hierarchical memory models

arXiv.org e-Print Archive

Archivio della Ricerca - Università di Pisa

Dagstuhl Research Online Publication Server

Archivio della ricerca- Università di Roma La Sapienza

Hal-Diderot

Proxy Caching for Video-on-Demand Using Flexible Starting Point Selection

Author: Li Xiaoling
Muhammad Muhammad
Steinbach Eckehard
Tu Wei
Publication venue: IEEE - Institute of Electrical and Electronics Engineers
Publication date: 01/01/2009
Field of study

Institute of Transport Research:Publications

c-trie++: A Dynamic Trie Tailored for Fast Prefix Searches

Author: Bannai Hideo
Inenaga Shunsuke
Kanda Shunsuke
Köppl Dominik
Nakashima Yuto
Takeda Masayuki
Tsuruta Kazuya
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/10/2020
Field of study

Given a dynamic set

K

k

strings of total length

n

whose characters are drawn from an alphabet of size

\sigma

, a keyword dictionary is a data structure built on

K

that provides locate, prefix search, and update operations on

K

. Under the assumption that

\alpha = w / \lg \sigma

characters fit into a single machine word

w

, we propose a keyword dictionary that represents

K

n \lg \sigma + \Theta(k \lg n)

bits of space, supporting all operations in

O(m / \alpha + \lg \alpha)

expected time on an input string of length

m

in the word RAM model. This data structure is underlined with an exhaustive practical evaluation, highlighting the practical usefulness of the proposed data structure, especially for prefix searches - one of the most elementary keyword dictionary operations

arXiv.org e-Print Archive

Crossref

Engineering Parallel String Sorting

Author: Bingmann Timo
Eberle Andreas
Sanders Peter
Publication venue
Publication date: 09/03/2014
Field of study

We discuss how string sorting algorithms can be parallelized on modern multi-core shared memory machines. As a synthesis of the best sequential string sorting algorithms and successful parallel sorting algorithms for atomic objects, we first propose string sample sort. The algorithm makes effective use of the memory hierarchy, uses additional word level parallelism, and largely avoids branch mispredictions. Then we focus on NUMA architectures, and develop parallel multiway LCP-merge and -mergesort to reduce the number of random memory accesses to remote nodes. Additionally, we parallelize variants of multikey quicksort and radix sort that are also useful in certain situations. Comprehensive experiments on five current multi-core platforms are then reported and discussed. The experiments show that our implementations scale very well on real-world inputs and modern machines.Comment: 46 pages, extension of "Parallel String Sample Sort" arXiv:1305.115

arXiv.org e-Print Archive

CiteSeerX

KITopen

Scalable String and Suffix Sorting: Algorithms, Techniques, and Tools

Author: Bingmann Timo
Publication venue
Publication date: 01/01/2018
Field of study

This dissertation focuses on two fundamental sorting problems: string sorting and suffix sorting. The first part considers parallel string sorting on shared-memory multi-core machines, the second part external memory suffix sorting using the induced sorting principle, and the third part distributed external memory suffix sorting with a new distributed algorithmic big data framework named Thrill.Comment: 396 pages, dissertation, Karlsruher Instituts f\"ur Technologie (2018). arXiv admin note: text overlap with arXiv:1101.3448 by other author

arXiv.org e-Print Archive

KITopen

Parallel String Sample Sort

Author: J. Kärkkäinen
J. Wassenberg
K. Mehlhorn
P. Sanders
P.M. McIlroy
R. Sinha
R. Sinha
R. Sinha
T. Hagerup
W. Ng
Publication venue
Publication date: 01/01/2013
Field of study

arXiv.org e-Print Archive

CiteSeerX

Crossref

KITopen