Search CORE

142 research outputs found

Architecture independent parallel selection with applications to parallel priority queues

Author: Gerbessiotis Alexandros V.
Siniolakis Constantinos J.
Publication venue: Elsevier Science B.V.
Publication date: 14/05/2003
Field of study

AbstractWe present a randomized selection algorithm whose performance is analyzed in an architecture independent way on the bulk-synchronous parallel (BSP) model of computation along with an application of this algorithm to dynamic data structures, namely parallel priority queues. We show that our algorithms improve previous results upon both the communication requirements and the amount of parallel slack required to achieve optimal performance. We also establish that optimality to within small multiplicative constant factors can be achieved for a wide range of parallel machines. While these algorithms are fairly simple themselves, descriptions of their performance in terms of the BSP parameters is somewhat involved; the main reward of quantifying these complications is that it allows transportable software to be written for parallel machines that fit the model

Elsevier - Publisher Connector

Fast Sorting on a Distributed-Memory Architecture

Author: Cheng David R.
Edelman Alan
Gilbert John R.
Shah Viral
Publication venue
Publication date: 01/01/2005
Field of study

We consider the often-studied problem of sorting, for a parallel computer. Given an input array distributed evenly over p processors, the task is to compute the sorted output array, also distributed over the p processors. Many existing algorithms take the approach of approximately load-balancing the output, leaving each processor with Θ(n/p) elements. However, in many cases, approximate load-balancing leads to inefficiencies in both the sorting itself and in further uses of the data after sorting. We provide a deterministic parallel sorting algorithm that uses parallel selection to produce any output distribution exactly, particularly one that is perfectly load-balanced. Furthermore, when using a comparison sort, this algorithm is 1-optimal in both computation and communication. We provide an empirical study that illustrates the efficiency of exact data splitting, and shows an improvement over two sample sort algorithms.Singapore-MIT Alliance (SMA

DSpace@MIT

Robust massively parallel sorting

Author: Axtmann Michael
Sanders Peter
Publication venue: SIAM Publ.
Publication date: 01/01/2017
Field of study

Crossref

KITopen

Practical Massively Parallel Sorting

Author: Axtmann Michael
Bingmann Timo
Sanders Peter
Schulz Christian
Publication venue: Association for Computing Machinery
Publication date: 01/01/2015
Field of study

KITopen

Space-Round Tradeoffs for MapReduce Computations

Author: Pietracaprina Andrea
Pucci Geppino
Riondato Matteo
Silvestri Francesco
Upfal Eli
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/11/2011
Field of study

This work explores fundamental modeling and algorithmic issues arising in the well-established MapReduce framework. First, we formally specify a computational model for MapReduce which captures the functional flavor of the paradigm by allowing for a flexible use of parallelism. Indeed, the model diverges from a traditional processor-centric view by featuring parameters which embody only global and local memory constraints, thus favoring a more data-centric view. Second, we apply the model to the fundamental computation task of matrix multiplication presenting upper and lower bounds for both dense and sparse matrix multiplication, which highlight interesting tradeoffs between space and round complexity. Finally, building on the matrix multiplication results, we derive further space-round tradeoffs on matrix inversion and matching

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Padova