Search CORE

42 research outputs found

Optimal parallel string algorithms: sorting, merching and computing the minimum

Author: Hagerup T.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/1993
Field of study

We study fundamental comparison problems on strings of characters, equipped with the usual lexicographical ordering. For each problem studied, we give a parallel algorithm that is optimal with respect to at least one criterion for which no optimal algorithm was previously known. Specifically, our main results are: % \begin{itemize} \item Two sorted sequences of strings, containing altogether

n

~characters, can be merged in

O(\log n)

time using

O(n)

operations on an EREW PRAM. This is optimal as regards both the running time and the number of operations. \item A sequence of strings, containing altogether

n

~characters represented by integers of size polynomial in~

n

, can be sorted in

O({{\log n}/{\log\log n}})

time using

O(n\log\log n)

operations on a CRCW PRAM. The running time is optimal for any polynomial number of processors. \item The minimum string in a sequence of strings containing altogether

n

characters can be found using (expected)

O(n)

operations in constant expected time on a randomized CRCW PRAM, in

O(\log\log n)

time on a deterministic CRCW PRAM with a program depending on~

n

, in

O((\log\log n)^3)

time on a deterministic CRCW PRAM with a program not depending on~

n

, in

O(\log n)

expected time on a randomized EREW PRAM, and in

O(\log n\log\log n)

time on a deterministic EREW PRAM. The number of operations is optimal, and the running time is optimal for the randomized algorithms and, if the number of processors is limited to~

n

, for the nonuniform deterministic CRCW PRAM algorithm as we

Deterministic Computations on a PRAM with Static Processor and Memory Faults.

Author: Chlebus Bogdan S
Gasieniec Leszek
Pelc Andrzej
Publication venue
Publication date: 01/01/2018
Field of study

We consider Parallel Random Access Machine (PRAM) which has some processors and memory cells faulty. The faults considered are static, i.e., once the machine starts to operate, the operational/faulty status of PRAM components does not change. We develop a deterministic simulation of a fully operational PRAM on a similar faulty machine which has constant fractions of faults among processors and memory cells. The simulating PRAM has

n

processors and

m

memory cells, and simulates a PRAM with

n

processors and a constant fraction of

m

memory cells. The simulation is in two phases: it starts with preprocessing, which is followed by the simulation proper performed in a step-by-step fashion. Preprocessing is performed in time

O((\frac{m}{n}+ \log n)\log n)

. The slowdown of a step-by-step part of the simulation is

O(\log m)

arXiv.org e-Print Archive

University of Liverpool Repository

Fast Parallel Algorithms for Basic Problems

Author: Wen Zhaofang
Publication venue: ODU Digital Commons
Publication date: 01/07/1991
Field of study

Parallel processing is one of the most active research areas these days. We are interested in one aspect of parallel processing, i.e. the design and analysis of parallel algorithms. Here, we focus on non-numerical parallel algorithms for basic combinatorial problems, such as data structures, selection, searching, merging and sorting. The purposes of studying these types of problems are to obtain basic building blocks which will be useful in solving complex problems, and to develop fundamental algorithmic techniques. In this thesis, we study the following problems: priority queues, multiple search and multiple selection, and reconstruction of a binary tree from its traversals. The research on priority queue was motivated by its various applications. The purpose of studying multiple search and multiple selection is to explore the relationships between four of the most fundamental problems in algorithm design, that is, selection, searching, merging and sorting; while our parallel solutions can be used as subroutines in algorithms for other problems. The research on the last problem, reconstruction of a binary tree from its traversals, was stimulated by a challenge proposed in a recent paper by Berkman et al. ( Highly Parallelizable Problems, STOC 89) to design doubly logarithmic time optimal parallel algorithms because a remarkably small number of such parallel algorithms exist

PRAMおよび対数時間一様な論理回路族に基づく計算量の階層(計算モデルと計算の複雑さに関する研究)

Author: Iwama Kazuo
Iwamoto Chuzo
Publication venue: 京都大学数理解析研究所
Publication date: 01/05/1996
Field of study

A Practical Hierarchial Model of Parallel Computation: The Model

Author: Heywood Todd
Ranka Sanjay
Publication venue: SURFACE at Syracuse University
Publication date: 01/02/1991
Field of study

We introduce a model of parallel computation that retains the ideal properties of the PRAM by using it as a sub-model, while simultaneously being more reflective of realistic parallel architectures by accounting for and providing abstract control over communication and synchronization costs. The Hierarchical PRAM (H-PRAM) model controls conceptual complexity in the face of asynchrony in two ways. First, by providing the simplifying assumption of synchronization to the design of algorithms, but allowing the algorithms to work asynchronously with each other; and organizing this control asynchrony via an implicit hierarchy relation. Second, by allowing the restriction of communication asynchrony in order to obtain determinate algorithms (thus greatly simplifying proofs of correctness). It is shown that the model is reflective of a variety of existing and proposed parallel architectures, particularly ones that can support massive parallelism. Relationships to programming languages are discussed. Since the PRAM is a sub-model, we can use PRAM algorithms as sub-algorithms in algorithms for the H-PRAM; thus results that have been established with respect to the PRAM are potentially transferable to this new model. The H-PRAM can be used as a flexible tool to investigate general degrees of locality (“neighborhoods of activity) in problems, considering communication and synchronization simultaneously. This gives the potential of obtaining algorithms that map more efficiently to architectures, and of increasing the number of processors that can efficiently be used on a problem (in comparison to a PRAM that charges for communication and synchronization). The model presents a framework in which to study the extent that general locality can be exploited in parallel computing. A companion paper demonstrates the usage of the H-PRAM via the design and analysis of various algorithms for computing the complete binary tree and the FFT/butterfly graph

Syracuse University Research Facility and Collaborative Environment