Search CORE

6 research outputs found

List Ranking on a Coarse Grained Multiprocessor

Author: Gustedt Jens
Guérin Lassous Isabelle
Publication venue: HAL CCSD
Publication date: 01/01/1999
Field of study

We present a deterministic algorithm for the List Ranking Problem on a Coarse Grained p-Multiprocessor (CGM) that is only a factor of log*(p) away from optimality. This statement holds as well for counting communication rounds where it achieves O(log(p) log*(p)) and for the required communication cost and total computation time where it achieves O(n log*(p)). We report on experimental studies of that algorithm on a variety of platforms that show the validity of the chosen CGM-model, and also show the possible gains and limits of such an algorithm. Finally, we suggest to extend CGM model by the communication blow up to allow better a priori predictions of communication costs of algorithms

INRIA a CCSD electronic archive server

List Ranking on PC Clusters

Author: Gustedt Jens
Guérin Lassous Isabelle
Publication venue: HAL CCSD
Publication date: 01/01/2000
Field of study

We present two algorithms for the List Ranking Problem in the Coarse Grained Multicomputer model (CGM for short): if

p

is the number of processors and

n

the size of the list, then we give a deterministic one that achieves

O(\log p \log^* p)

communication rounds and

O(n \log^* p)

for the required communication cost and total computation time; and a randomized one that requires

O(\log p)

communication rounds and

O(n)

for the required communication cost and total computation time. We report on experimental studies of these algorithms on a PC cluster interconnected by a Myrinet network. As far as we know, it is the first portable code on this problem that runs on a cluster. With these experimental studies, we study the validity of the chosen CGM-model, and also show the possible gains and limits of such algorithms for PC clusters

CiteSeerX

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Feasability, Portability, Predictability and Efficiency : Four Ambitious Goals for the Design and Implementation of Parallel Coarse Grained Graph Algorithms

Author: Gustedt Jens
Guérin Lassous Isabelle
Morvan Michel
Publication venue: HAL CCSD
Publication date: 01/01/2000
Field of study

We study the relationship between the design and analysis of graph algorithms in the coarsed grained parallel models and the behavior of the resulting code on todays parallel machines and clusters. We conclude that the coarse grained multicomputer model (CGM) is well suited to design competitive algorithms, and that it is thereby now possible to aim to develop portable, predictable and efficient parallel algorithms code for graph problems

INRIA a CCSD electronic archive server

Better Trade-offs for Parallel List Ranking

Author: Sibeyn J.
Publication venue: ACM Press
Publication date: 01/01/1997
Field of study

An earlier parallel list ranking algorithm performs well for problem sizes

N

that are extremely large in comparison to the number of PUs

P

. However, no existing algorithm gives good performance for reasonable loads. We present a novel family of algorithms, that achieve a better trade-off between the number of start-ups and the routing volume. We have implemented them on an Intel Paragon, and they turn out to considerably outperform all earlier algorithms: with

P = 2

the sequential algorithm is already beaten for N = \mbox{25,000}; for

P = 100

and

N = 10^7

, the speed-up is 21, and for

N = 10^8

it even reaches 30. A modification of one of our algorithms solves a theoretical question: we show that on one-dimensional processor arrays, list ranking can be solved with a number of steps equal to the diameter of the network

MPG.PuRe

Better Trade-offs for Parallel List Ranking

Author: For N
Jop Sibeyn
Publication venue
Publication date
Field of study

An earlier parallel list ranking algorithm performs well for problem sizes N that are extremely large in comparison to the number of PUs P . However, no existing algorithm gives good performance for reasonable loads. We present a novel family of algorithms, that achieve a better trade-off between the number of start-ups and the routing volume. We have implemented them on an Intel Paragon, and they turn out to considerably outperform all earlier algorithms: with P = 2 the sequential algorithm is already beaten for N = 25,000; for P = 100 and N = 10 7 , the speed-up is 21, and for N = 10 8 it even reaches 30. A modification of one of our algorithms solves a theoretical question: we show that on one-dimensional processor arrays, list ranking can be solved with a number of steps equal to the diameter of the network. 1 Introduction A linked list, hereafter just list, is a basic data structure: it consists of nodes which are linked together, such that every node has precisely one predec..

CiteSeerX