17,064 research outputs found

    A Multilevel Approach to Topology-Aware Collective Operations in Computational Grids

    Full text link
    The efficient implementation of collective communiction operations has received much attention. Initial efforts produced "optimal" trees based on network communication models that assumed equal point-to-point latencies between any two processes. This assumption is violated in most practical settings, however, particularly in heterogeneous systems such as clusters of SMPs and wide-area "computational Grids," with the result that collective operations perform suboptimally. In response, more recent work has focused on creating topology-aware trees for collective operations that minimize communication across slower channels (e.g., a wide-area network). While these efforts have significant communication benefits, they all limit their view of the network to only two layers. We present a strategy based upon a multilayer view of the network. By creating multilevel topology-aware trees we take advantage of communication cost differences at every level in the network. We used this strategy to implement topology-aware versions of several MPI collective operations in MPICH-G2, the Globus Toolkit[tm]-enabled version of the popular MPICH implementation of the MPI standard. Using information about topology provided by MPICH-G2, we construct these multilevel topology-aware trees automatically during execution. We present results demonstrating the advantages of our multilevel approach by comparing it to the default (topology-unaware) implementation provided by MPICH and a topology-aware two-layer implementation.Comment: 16 pages, 8 figure

    LEGaTO: first steps towards energy-efficient toolset for heterogeneous computing

    Get PDF
    LEGaTO is a three-year EU H2020 project which started in December 2017. The LEGaTO project will leverage task-based programming models to provide a software ecosystem for Made-in-Europe heterogeneous hardware composed of CPUs, GPUs, FPGAs and dataflow engines. The aim is to attain one order of magnitude energy savings from the edge to the converged cloud/HPC.Peer ReviewedPostprint (author's final draft

    ARTICLE NO. PC971367 A Library-Based Approach to Task Parallelism in a Data-Parallel Language

    Get PDF
    Pure data-parallel languages such as High Performance Fortran version 1 (HPF) do not allow efficient expression of mixed task/data-parallel computations or the coupling of separately compiled data-parallel modules. In this paper, we show how these common parallel program structures can be represented, with only minor extensions to the HPF model, by using a coordination library based on the Message Passing Interface (MPI). This library allows data-parallel tasks to exchange distributed data structures using calls to simple communication functions. We present microbenchmark results that characterize the performance of this library and that quantify the impact of optimizations that allow reuse of communication schedules in common situations. In addition, results from two-dimensional FFT, convolution, and multiblock programs demonstrate that the HPF/ MPI library can provide performance superior to that of pure HPF. We conclude that this synergistic combination of two parallel programming standards represents a useful approach to task parallelism in a data-parallel framework, increasing the range of problems addressable in HPF without requiring complex compile

    dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems

    Get PDF
    Modern computer systems are becoming increasingly heterogeneous by comprising multi-core CPUs, GPUs, and other accelerators. Current programming approaches for such systems usually require the application developer to use a combination of several programming models (e. g., MPI with OpenCL or CUDA) in order to exploit the full compute capability of a system. In this paper, we present dOpenCL (Distributed OpenCL) – a uniform approach to programming distributed heterogeneous systems with accelerators. dOpenCL extends the OpenCL standard, such that arbitrary computing devices installed on any node of a distributed system can be used together within a single application. dOpenCL allows moving data and program code to these devices in a transparent, portable manner. Since dOpenCL is designed as a fully-fledged implementation of the OpenCL API, it allows running existing OpenCL applications in a heterogeneous distributed environment without any modifications. We describe in detail the mechanisms that are required to implement OpenCL for distributed systems, including a device management mechanism for running multiple applications concurrently. Using three application studies, we compare the performance of dOpenCL with MPI+OpenCL and a standard OpenCL implementation

    MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation

    Full text link
    MADNESS (multiresolution adaptive numerical environment for scientific simulation) is a high-level software environment for solving integral and differential equations in many dimensions that uses adaptive and fast harmonic analysis methods with guaranteed precision based on multiresolution analysis and separated representations. Underpinning the numerical capabilities is a powerful petascale parallel programming environment that aims to increase both programmer productivity and code scalability. This paper describes the features and capabilities of MADNESS and briefly discusses some current applications in chemistry and several areas of physics

    The new Nowhere Land? : a research and practice agenda for the “Always on Culture"

    Get PDF
    Purpose: Rapid developments in the field of information communication technology (ICT) mean that e-working has become increasingly common and prolonged – the “always-onculture” potential to enhance work-life balance via increased flexibility in terms of time and location, as well as posing the risk of being ‘always on’ has been identified with potentially serious implications for the health and performance of employees. We identify a research agenda and review current organizational practice. Approach: We discuss current technological developments as well as prevalent research frameworks and terminology in the domain of work-life balance and beyond to evaluate their fitness for purpose. We also report findings from a survey of 374 employees working within UK businesses about current organisational practice. Findings: Over half of the organisations sampled do not have clear guidance regarding worklife balance and supporting employees with regards to ICT enabled working. Key challenges are the sheer volume of email traffic, lack of training and infrastructure and an absence of appropriate support. Practical implications: Organisations need to develop clear policies regarding the psychosocial aspects of technology use and provide evidence-based guidance to managers and employees. Social implications: Managers and individuals require support to engage with technology in a healthy and sustainable way

    Requirements of time management tools for outpatient physiotherapy practice

    Get PDF
    The effects of electronic appointment booking systems on the time management activities of health professionals have received little attention to date. We report on time management practices in three outpatient physiotherapy departments with different paper and electronic systems. The study has identified a set of time management activities and associated social behaviours common to physiotherapy departments. The convenience, flexibility and expressive nature of paper diary systems is of significant value to users, whilst the clarity and superior database functionality of electronic systems are valued by staff using this medium. The study highlights several potential barriers to the effective deployment of electronic booking systems in physiotherapy departments, including poor resource and training provision, concerns regarding restrictive diary control measures, the continued reliance on burdensome duplication procedures and the need to coordinate multiple information artefacts, which need to be addressed if such technology is to be successfully designed and deployed. Copyright © 2005 SAGE Publications (London, Thousand Oaks, CA and New Delhi)
    • 

    corecore