Search CORE

41 research outputs found

Mechanisms for efficient, protected messaging

Author: Lee Whay Sing, 1967-
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1999
Field of study

Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1999.Includes bibliographical references (p. 143-149).by Whay Sing Lee.Ph.D

CiteSeerX

DSpace@MIT

Doctor of Philosophy

Author: Parker Michael Allen
Publication venue: University of Utah
Publication date: 01/08/2013
Field of study

dissertationHigh-performance supercomputers on the Top500 list are commonly designed around commodity CPUs. Most of the codes executed on these machines are message-passing codes using the message-passing toolkit (MPI). Thus it makes sense to look at these machines from a holistic systems architecture perspective and consider optimizations to commodity processors that make them more efficient in message-passing architectures. Described herein is a new User-Level Notification (ULN) architecture that significantly improves message-passing performance. The architecture integrates a simultaneous multithreaded (SMT) processor with a user-level network interface (NI) that can directly control the execution scheduling of threads on the processor. By allowing the network interface to control the execution of message handling code at the user level, the operating system (OS) related overhead for handling interrupts and user code dispatch related to notifications is eliminated. By using an SMT processor, message handling can be performed in one thread concurrent to user computation in other threads, thus most of the overhead of executing message handlers can be hidden. This dissertation presents measurements showing the OS overheads related to message-passing are significant in modern architectures and describes a new architecture that significantly reduces these overheads. On a communication-intensive real-world application, the ULN architecture provides a 50.9% performance improvement over a more traditional OS-based NIC and a 5.29-31.9% improvement over a best-of-class user-level NIC due to the user-level notifications

The University of Utah: J. Willard Marriott Digital Library

An evaluation of Fugu's network deadlock avoidance solution

Author: Lee Victor Wui-Keung
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1996
Field of study

Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1996.Includes bibliographical references (leaves 82-86).by Victor Lee.M.S

DSpace@MIT

Design and implementation of a multi-purpose cluster system NIU

Author: Ang Boon Seong, 1966-
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1999
Field of study

Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1999.Includes bibliographical references (p. 209-221).by Boon Seong Ang.Ph.D

DSpace@MIT

Generalized Portable SHMEM library for high performance computing

Author: Parzyszek Krzysztof
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2003
Field of study

The Generalized Portable SHMEM library (GPSHMEM) is a portable implementation of the SHMEM library originally released by Cray Research Inc. on the Cray T3D. SHMEM and GPSHMEM realize the distributed shared memory programming model, that is, a shared memory programming model in environments in which memory is physically distributed. It is intended for use on a large variety of hardware platforms, including distributed systems with a network interconnect. The programming interface of GPSHMEM follows that of SHMEM and includes remote memory access operations (one-sided communication) and a set of collective routines such as broadcast, collection and reduction. Programming interfaces for C and Fortran are provided. Because of the minimal assumptions about the underlying hardware, GPSHMEM does not implement the full SHMEM T3D interface. The lack of a few functions is compensated by a set of extensions, including dynamic memory allocation for Fortran 77. To ease porting of SHMEM-enabled scientific Fortran 77 code from the Cray machines to use with GPSHMEM, a specialized Fortran 77 preprocessor was designed and developed

Digital Repository @ Iowa State University (ISU)

UNT Digital Library

Generalized Portable SHMEM Library for High Performance Computing

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref

18th IEEE Workshop on Nonlinear Dynamics of Electronic Systems: Proceedings

Author: Kelber Kristina
Schwarz Wolfgang
Tetzlaff Ronald
Publication venue: Technische Universität Dresden
Publication date: 03/08/2010
Field of study

Proceedings of the 18th IEEE Workshop on Nonlinear Dynamics of Electronic Systems, which took place in Dresden, Germany, 26 – 28 May 2010.:Welcome Address ........................ Page I Table of Contents ........................ Page III Symposium Committees .............. Page IV Special Thanks ............................. Page V Conference program (incl. page numbers of papers) ................... Page VI Conference papers Invited talks ................................ Page 1 Regular Papers ........................... Page 14 Wednesday, May 26th, 2010 ......... Page 15 Thursday, May 27th, 2010 .......... Page 110 Friday, May 28th, 2010 ............... Page 210 Author index ............................... Page XII

Technische Universität Dresden: Qucosa

Combinatorial Design and Analysis of Optimal Multiple Bus Systems for Parallel Algorithms.

Author: Kulasinghe Priyalal D
Publication venue: LSU Digital Commons
Publication date: 01/01/1995
Field of study

This dissertation develops a formal and systematic methodology for designing optimal, synchronous multiple bus systems (MBSs) realizing given (classes of) parallel algorithms. Our approach utilizes graph and group theoretic concepts to develop the necessary model and procedural tools. By partitioning the vertex set of the graphical representation CFG of the algorithm, we extract a set of interconnection functions that represents the interprocessor communication requirement of the algorithm. We prove that the optimal partitioning problem is NP-Hard. However, we show how to obtain polynomial time solutions by exploiting certain regularities present in many well-behaved parallel algorithms. The extracted set of interconnection functions is represented by an edge colored, directed graph called interconnection function graph (IFG). We show that the problem of constructing an optimal MBS to realize an IFG is NP-Hard. We show important special cases where polynomial time solutions exist. In particular, we prove that polynomial time solutions exist when the IFG is vertex symmetric. This is the case of interest for the vast majority of important interconnection function sets, whether extracted from algorithms or correspond to existing interconnection networks. We show that an IFG is vertex symmetric if and only if it is the Cayley color graph of a finite group

\Gamma

and its generating set

\Delta.

Using this property, we present a particular scheme to construct a symmetric

MBS\ M(\Gamma,\Delta)

with minimum number of buses as well as minimum number of interfaces realizing a vertex symmetric IFG. We demonstrate several advantages of the optimal

MBS\ M(\Gamma,\Delta)

in terms of its symmetry, number of ports per processor, number of neighbors per processor, and the diameter. We also investigate the fault tolerant capabilities and performance degradation of

M(\Gamma,\Delta)

in the case of a single bus failure, single driver failure, single receiver failure, and single processor failure. Further, we address the problem of designing an optimal MBS realizing a class of algorithms when the number of buses and/or processors in the target MBS are specified. The optimality criteria are maximizing the speed and minimizing the number of interfaces

Louisiana State University

Small-world interconnection networks for large parallel computer systems

Author: Rodríguez Salazar Fernando
Publication venue
Publication date: 01/01/2004
Field of study

The use of small-world graphs as interconnection networks of multicomputers is proposed and analysed in this work. Small-world interconnection networks are constructed by adding (or modifying) edges to an underlying local graph. Graphs with a rich local structure but with a large diameter are shown to be the most suitable candidates for the underlying graph. Generation models based on random and deterministic wiring processes are proposed and analysed. For the random case basic properties such as degree, diameter, average length and bisection width are analysed, and the results show that a fast transition from a large diameter to a small diameter is experienced when the number of new edges introduced is increased. Random traffic analysis on these networks is undertaken, and it is shown that although the average latency experiences a similar reduction, networks with a small number of shortcuts have a tendency to saturate as most of the traffic flows through a small number of links. An analysis of the congestion of the networks corroborates this result and provides away of estimating the minimum number of shortcuts required to avoid saturation. To overcome these problems deterministic wiring is proposed and analysed. A Linear Feedback Shift Register is used to introduce shortcuts in the LFSR graphs. A simple routing algorithm has been constructed for the LFSR and extended with a greedy local optimisation technique. It has been shown that a small search depth gives good results and is less costly to implement than a full shortest path algorithm. The Hilbert graph on the other hand provides some additional characteristics, such as support for incremental expansion, efficient layout in two dimensional space (using two layers), and a small fixed degree of four. Small-world hypergraphs have also been studied. In particular incomplete hypermeshes have been introduced and analysed and it has been shown that they outperform the complete traditional implementations under a constant pinout argument. Since it has been shown that complete hypermeshes outperform the mesh, the torus, low dimensional m-ary d-cubes (with and without bypass channels), and multi-stage interconnection networks (when realistic decision times are accounted for and with a constant pinout), it follows that incomplete hypermeshes outperform them as well

Glasgow Theses Service

OpenGrey Repository

Shrimping under working conditions

Author: Gallardo Francisco
Samson Audrey
Publication venue: Autonomedia (DATA browser 06)
Publication date: 01/01/2017
Field of study

We propose that mutated forms of death are emerging with neoliberalism’s biopolitical financialisation of life. Thinking of such forms as commercial extinction and social death, how do we begin to frame these outside of a quantified rhetoric of surplus? These questions aim to provoke a discussion about these terms that can be interpreted as modes of exhaustion, while maintaining particular biological, social or economic conditions of life. When we are confronted with capitalism’s failure to fulfil resource exhaustion, a model of conservation by dispossession1 might emerge within what Rosi Braidotti calls “new and subtler degrees of death and extinction” (2013, 115). In this text we want to think with other conditions of death and extinction that can help to move beyond the missing item of an inventory, a carved rock along a fossil road or a set of pre-emptive actions to be executed beyond a certain threshold. Thus, we ask if there could be figures, which rather than narrating death as a biological or geological concept, open it up to other equally violent forces that are nevertheless materially situated. More importantly, will we ever be able to think of extinction beyond ideas of absence or frame death from social or economic realms as an emerging mode of living? In order to address many of these questions we dissect a critical example of extinction, that of the brown shrimp (Crangon crangon) as it flips between commercial (albeit not yet biotic) death in the ex-fishing grounds of the South East corner of the UK, and the social death embedded in the labour-power of the ex-processing factories of the Special Economic Zones of Tangier and Tetuan in Morocco

Goldsmiths Research Online

Greenwich Academic Literature Archive