Search CORE

524,533 research outputs found

Small computer system interface (SCSI) universal services for the turbonet parallel computer

Author: Melkonian Artak Ohan
Publication venue: Digital Commons @ NJIT
Publication date: 31/01/1996
Field of study

TurboNet is a parallel computer with shared-memory and message-passing hybrid architecture. It employs two boards, with four digital signal processors (DSPs) each, and a host FORCE SPARC CPU-2CE board with a SCSI bus. Software has been developed in this thesis to provide SCSI services to programs running on the DSPs. DSP programs can therefore fully control assigned SCSI devices at the SCSI command level. Transfer control modifiers ensure compatibility with most SCSI devices. The software provides service for three SCSI access levels. The su SCSI universal device driver is built into the host computer\u27s kernel and is a gateway to the SCSI bus from user contexts. The hscsid SCSI request server daemon is an interrupt driven link between the DSP programs and the su driver. The Hydra SCSI utilities can be included in programs to make SCSI programming easier

Digital Commons @ New Jersey Institute of Technology (NJIT)

Recommended from our members

POWER: Parallel Optimizations With Executable Rewriting

Author: Arora Nipun
Bell Jonathan Schaffer
Kaiser Gail E.
Kim Martha Allen
Singh Vishal
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2011
Field of study

The hardware industry's rapid development of multicore and many core hardware has outpaced the software industry's transition from sequential to parallel programs. Most applications are still sequential, and many cores on parallel machines remain unused. We propose a tool that uses data-dependence profiling and binary rewriting to parallelize executables without access to source code. Our technique uses Bernstein's conditions to identify independent sets of basic blocks that can be executed in parallel, introducing a level of granularity between fine-grained instruction level and coarse grained task level parallelism. We analyze dynamically generated control and data dependence graphs to find independent sets of basic blocks which can be parallelized. We then propose to parallelize these candidates using binary rewriting techniques. Our technique aims to demonstrate the parallelism that remains in serial application by exposing concrete opportunities for parallelism

Columbia University Academic Commons

The new controls infrastructure for the SPS

Author: Charrue P
Clayton M J
Publication venue
Publication date: 28/11/1995
Field of study

A completely new control infrastructure has been installed in the SPS machine and experimental areas, replacing the old control system based on NORD computers that dated back to the 1970s. The new system uses Unix workstations and X terminals to replace the old console computers, and PC and VME chassis running LynxOS to replace the low-level interface computers. This paper will present the old equipment access, then describe the transitional phase when the two systems were run in parallel followed by the final complete transition to the new system and the removal of the NORD computers. A great effort was made to recreate the old programming environment in the new system in order to preserve the enormous investment in application programs. The equipment access and the NORD console simulator are two examples of this effort. Finally the paper will present the results of the first few months of operation of the SPS and its experimental areas with this new control infrastructure

CERN Document Server

Towards Implicit Parallel Programming for Systems

Author: Ertel Sebastian
Publication venue
Publication date: 30/12/2019
Field of study

Multi-core processors require a program to be decomposable into independent parts that can execute in parallel in order to scale performance with the number of cores. But parallel programming is hard especially when the program requires state, which many system programs use for optimization, such as for example a cache to reduce disk I/O. Most prevalent parallel programming models do not support a notion of state and require the programmer to synchronize state access manually, i.e., outside the realms of an associated optimizing compiler. This prevents the compiler to introduce parallelism automatically and requires the programmer to optimize the program manually. In this dissertation, we propose a programming language/compiler co-design to provide a new programming model for implicit parallel programming with state and a compiler that can optimize the program for a parallel execution. We define the notion of a stateful function along with their composition and control structures. An example implementation of a highly scalable server shows that stateful functions smoothly integrate into existing programming language concepts, such as object-oriented programming and programming with structs. Our programming model is also highly practical and allows to gradually adapt existing code bases. As a case study, we implemented a new data processing core for the Hadoop Map/Reduce system to overcome existing performance bottlenecks. Our lambda-calculus-based compiler automatically extracts parallelism without changing the program's semantics. We added further domain-specific semantic-preserving transformations that reduce I/O calls for microservice programs. The runtime format of a program is a dataflow graph that can be executed in parallel, performs concurrent I/O and allows for non-blocking live updates

Technische Universität Dresden: Qucosa