Search CORE

87,235 research outputs found

A Lower Bound Technique for Communication in BSP

Author: Bilardi Gianfranco
Scquizzato Michele
Silvestri Francesco
Publication venue
Publication date: 25/11/2017
Field of study

Communication is a major factor determining the performance of algorithms on current computing systems; it is therefore valuable to provide tight lower bounds on the communication complexity of computations. This paper presents a lower bound technique for the communication complexity in the bulk-synchronous parallel (BSP) model of a given class of DAG computations. The derived bound is expressed in terms of the switching potential of a DAG, that is, the number of permutations that the DAG can realize when viewed as a switching network. The proposed technique yields tight lower bounds for the fast Fourier transform (FFT), and for any sorting and permutation network. A stronger bound is also derived for the periodic balanced sorting network, by applying this technique to suitable subnetworks. Finally, we demonstrate that the switching potential captures communication requirements even in computational models different from BSP, such as the I/O model and the LPRAM

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Padova

Optimized Merge Sort on Modern Commodity Multi-core CPUs

Author: Xu Ming
Xu Xianbin
Yin MengJia
Zheng Fang
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/03/2016
Field of study

Sorting is a kind of widely used basic algorithms. As the high performance computing devices are increasingly common, more and more modern commodity machines have the capability of parallel concurrent computing. A new implementation of sorting algorithms is proposed to harness the power of newer SIMD operations and multi-core computing provided by modern CPUs. The algorithm is hybrid by optimized bitonic sorting network and multi-way merge. New SIMD instructions provided by modern CPUs are used in the bitonic network implementation, which adopted a different method to arrange data so that the number of SIMD operations is reduced. Balanced binary trees are used in multi-way merge, which is also different with former implementations. Efforts are also paid on minimizing data moving in memory since merge sort is a kind of memory-bound application. The performance evaluation shows that the proposed algorithm is twice as fast as the sort function in C++ standard library when only single thread is used. It also outperforms radix sort implemented in Boost library

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

The $p$ -Center Problem in Tree Networks Revisited

Author: Banik Aritra
Bhattacharya Binay
Das Sandip
Kameda Tsunehiko
Song Zhao
Publication venue
Publication date: 01/01/2016
Field of study

We present two improved algorithms for weighted discrete

p

-center problem for tree networks with

n

vertices. One of our proposed algorithms runs in

O(n \log n + p \log^2 n \log(n/p))

time. For all values of

p

, our algorithm thus runs as fast as or faster than the most efficient

O(n\log^2 n)

time algorithm obtained by applying Cole's speed-up technique [cole1987] to the algorithm due to Megiddo and Tamir [megiddo1983], which has remained unchallenged for nearly 30 years. Our other algorithm, which is more practical, runs in

O(n \log n + p^2 \log^2(n/p))

time, and when

p=O(\sqrt{n})

it is faster than Megiddo and Tamir's

O(n \log^2n \log\log n)

time algorithm [megiddo1983]

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Faster 3-Periodic Merging Networks

Author: Piotrów Marek
Publication venue
Publication date: 02/01/2014
Field of study

We consider the problem of merging two sorted sequences on a comparator network that is used repeatedly, that is, if the output is not sorted, the network is applied again using the output as input. The challenging task is to construct such networks of small depth. The first constructions of merging networks with a constant period were given by Kuty{\l}owski, Lory\'s and Oesterdikhoff. They have given