Search CORE

64,560 research outputs found

Software for Parallel Computing: the LAM Implementation of MPI

Author: NC DOCKS at The University of North Carolina at Greensboro
Swann Christopher A.
Publication venue
Publication date: 01/01/2001
Field of study

Many econometric problems can benefit from the application of parallel computing techniques, and recent advances in hardware and software have made such application feasible. There are a number of freely available software libraries that make it possible to write message passing parallel programs using personal computers or Unix workstations. This review discusses one of these—the LAM (Local Area Multiprocessor) implementation of MPI (the Message Passing Interface)

The University of North Carolina at Greensboro

Efficient shared memory message passing for inter-VM communications

Author: Diakhaté François
Jourdren Hervé
Namyst Raymond
Pérache Marc
Publication venue: HAL CCSD
Publication date: 01/08/2008
Field of study

Thanks to recent advances in virtualization technologies, it is now possible to beneﬁt from the ﬂexibility brought by virtual machines at little cost in terms of CPU performance. However on HPC clusters some overheads remain which prevent widespread usage of virtualization. In this article, we tackle the issue of inter-VM MPI communications when VMs are located on the same physical machine. To achieve this we introduce a virtual device which provides a simple message passing API to the guest OS. This interface can then be used to implement an efficient MPI library for virtual machines. The use of a virtual device makes our solution easily portable across multiple guest operating systems since it only requires a small driver to be written for this device. We present an implementation based on Linux, the KVM hypervisor and Qemu as its userspace device emulator. Our implementation achieves near native performance in terms of MPI latency and bandwidth

Crossref

INRIA a CCSD electronic archive server

HAL-CEA

POSH: Paris OpenSHMEM: A High-Performance OpenSHMEM Implementation for Shared Memory Systems

Author: Coti Camille
Publication venue
Publication date: 30/03/2014
Field of study

In this paper we present the design and implementation of POSH, an Open-Source implementation of the OpenSHMEM standard. We present a model for its communications, and prove some properties on the memory model defined in the OpenSHMEM specification. We present some performance measurements of the communication library featured by POSH and compare them with an existing one-sided communication library. POSH can be downloaded from \url{http://www.lipn.fr/~coti/POSH}. % 9 - 67Comment: This is an extended version (featuring the full proofs) of a paper accepted at ICCS'1

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Total order broadcast for fault tolerant exascale systems

Author: Appavoo Jonathan
Cadden James
Krieger Orran
Schatzberg Dan
Publication venue: Computer Science Department, Boston University
Publication date: 10/07/2013
Field of study

In the process of designing a new fault tolerant run-time for future exascale systems, we discovered that a total order broadcast would be necessary. That is, nodes of a supercomputer should be able to broadcast messages to other nodes even in the face of failures. All messages should be seen in the same order at all nodes. While this is a well studied problem in distributed systems, few researchers have looked at how to perform total order broadcasts at large scales for data availability. Our experience implementing a published total order broadcast algorithm showed poor scalability at tens of nodes. In this paper we present a novel algorithm for total order broadcast which scales logarithmically in the number of processes and is not delayed by most process failures. While we are motivated by the needs of our run-time we believe this primitive is of general applicability. Total order broadcasts are used often in datacenter environments and as HPC developers begins to address fault tolerance at the application level we believe they will need similar primitives

Boston University Institutional Repository (OpenBU)