Search CORE

2,509 research outputs found

A Lower Bound Technique for Communication in BSP

Author: Bilardi Gianfranco
Scquizzato Michele
Silvestri Francesco
Publication venue
Publication date: 25/11/2017
Field of study

Communication is a major factor determining the performance of algorithms on current computing systems; it is therefore valuable to provide tight lower bounds on the communication complexity of computations. This paper presents a lower bound technique for the communication complexity in the bulk-synchronous parallel (BSP) model of a given class of DAG computations. The derived bound is expressed in terms of the switching potential of a DAG, that is, the number of permutations that the DAG can realize when viewed as a switching network. The proposed technique yields tight lower bounds for the fast Fourier transform (FFT), and for any sorting and permutation network. A stronger bound is also derived for the periodic balanced sorting network, by applying this technique to suitable subnetworks. Finally, we demonstrate that the switching potential captures communication requirements even in computational models different from BSP, such as the I/O model and the LPRAM

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Padova

Verification of an algorithm for log-time sorting by square comparison

Author: Mulder J.C.
Weijland W.P.
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/1990
Field of study

CWI's Institutional Repository

Fast arithmetic computing with neural networks

Author: Bruck Jehoshua
Siu Kai-Yeung
Publication venue: IEEE Hong Kong Section
Publication date: 01/09/1990
Field of study

The authors introduce a restricted model of a neuron which is more practical as a model of computation then the classical model of a neuron. The authors define a model of neural networks as a feedforward network of such neurons. Whereas any logic circuit of polynomial size (in n) that computes the product of two n-bit numbers requires unbounded delay, such computations can be done in a neural network with constant delay. The authors improve some known results by showing that the product of two n-bit numbers and sorting of n n-bit numbers can both be computed by a polynomial size neural network using only four unit delays, independent of n . Moreover, the weights of each threshold element in the neural networks require only O(log n)-bit (instead of n-bit) accuracy

Caltech Authors

A taxonomy of parallel sorting

Author: Bitton Dina
DeWitt David J.
Hsiao David K.
Menon Jaishankar
Publication venue
Publication date: 01/04/1984
Field of study

TR 84-601In this paper, we propose a taxonomy of parallel sorting that includes a broad range of array and file sorting algorithms. We analyze the evolution of research on parallel sorting, from the earliest sorting networks to the shared memory algorithms and the VLSI sorters. In the context of sorting networks, we describe two fundamental parallel merging schemes - the odd-even and the bitonic merge. Sorting algorithms have been derived from these merging algorithms for parallel computers where processors communicate through interconnection networks such as the perfect shuffle, the mesh and a number of other sparse networks. After describing the network sorting algorithms, we show that, with a shared memory model of parallel computation, faster algorithms have been derived from parallel enumeration sorting schemes, where keys are first ranked and then rearranged according to their rank

eCommons@Cornell

Calhoun, Institutional Archive of the Naval Postgraduate School

Verification of an algorithm for log-time sorting by square comparison

Author: Mulder J.C.
Weijland W.P.
Publication venue: CWI
Publication date: 01/01/1987
Field of study

CWI's Institutional Repository