Search CORE

5,946 research outputs found

Improved parallel integer sorting without concurrent writing

Author: Albers S.
Hagerup T.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/1994
Field of study

We show that

n

integers in the range 1 \twodots n can be stably sorted on an \linebreak EREW PRAM using \nolinebreak

O(t)

time \linebreak and

O(n(\sqrt{\log n\log\log n}+{{(\log n)^2}/t}))

operations, for arbitrary given \linebreak

t\ge\log n\log\log n

, and on a CREW PRAM using %

O(\log n\log\log n)

time and

O(n\sqrt{\log n})

O(t)

time and

O(n(\sqrt{\log n}+{{\log n}/{2^{{t/{\log n}}}}}))

operations, for arbitrary given

t\ge\log n

. In addition, we are able to sort

n

arbitrary integers on a randomized CREW PRAM % using %

O(\log n\log\log n)

time and

O(n\sqrt{\log n})

operations within the same resource bounds with high probability. In each case our algorithm is a factor of almost

\Theta(\sqrt{\log n})

closer to optimality than all previous algorithms for the stated problem in the stated model, and our third result matches the operation count of the best known sequential algorithm. We also show that

n

integers in the range 1 \twodots m can be sorted in

O((\log n)^2)

time with

O(n)

operations on an EREW PRAM using a nonstandard word length of

O(\log n \log\log n \log m)

bits, thereby greatly improving the upper bound on the word length necessary to sort integers with a linear time-processor product, even sequentially. Our algorithms were inspired by, and in one case directly use, the fusion trees of Fredman and Willard

CiteSeerX

MPG.PuRe

A Bulk-Parallel Priority Queue in External Memory with STXXL

Author: GS Brodal
J Singler
JS Vitter
L Arge
MC Pinotti
N Deo
P Sanders
P Sanders
PJ Varman
R Dementiev
Publication venue
Publication date: 01/01/2015
Field of study

We propose the design and an implementation of a bulk-parallel external memory priority queue to take advantage of both shared-memory parallelism and high external memory transfer speeds to parallel disks. To achieve higher performance by decoupling item insertions and extractions, we offer two parallelization interfaces: one using "bulk" sequences, the other by defining "limit" items. In the design, we discuss how to parallelize insertions using multiple heaps, and how to calculate a dynamic prediction sequence to prefetch blocks and apply parallel multiway merge for extraction. Our experimental results show that in the selected benchmarks the priority queue reaches 75% of the full parallel I/O bandwidth of rotational disks and and 65% of SSDs, or the speed of sorting in external memory when bounded by computation.Comment: extended version of SEA'15 conference pape

arXiv.org e-Print Archive

Crossref

KITopen

Parallel Merging and Sorting on Linked List

Author: Han Yijie
Tata Sreevalli
Publication venue: 'Asian Online Journals'
Publication date: 30/03/2021
Field of study

We study linked list sorting and merging on the PRAM model. In this paper we show that n real numbers can be sorted into a linked list in constant time with n2+e processors or in ) time with n2 processors. We also show that two sorted linked lists of n integers in {0, 1, …, m} can be merged into one sorted linked list in O(log(c)n(loglogm)1/2) time using n/(log(c)n(loglogm)1/2) processors, where c is an arbitrarily large constant

International Journal of Computer and Information Technology

Parallel Wavelet Tree Construction

Author: Shun Julian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/04/2015
Field of study

We present parallel algorithms for wavelet tree construction with polylogarithmic depth, improving upon the linear depth of the recent parallel algorithms by Fuentes-Sepulveda et al. We experimentally show on a 40-core machine with two-way hyper-threading that we outperform the existing parallel algorithms by 1.3--5.6x and achieve up to 27x speedup over the sequential algorithm on a variety of real-world and artificial inputs. Our algorithms show good scalability with increasing thread count, input size and alphabet size. We also discuss extensions to variants of the standard wavelet tree.Comment: This is a longer version of the paper that appears in the Proceedings of the IEEE Data Compression Conference, 201

arXiv.org e-Print Archive

Crossref