577 research outputs found

    The Fortran parallel transformer and its programming environment

    Get PDF

    Parallelization of subdomain methods with overlapping for linear and nonlinear convection-diffusion problems

    Get PDF
    International audienceLinear and nonlinear convection-diffusion problems are considered. The numerical solution of these problems via the Schwarz alternating method is studied. A new class of parallel asynchronous iterative methods with flexible communication is applied. The implementation of parallel asyn-chronous and synchronous algorithms on distributed memory multiprocessors is described. Experimental results obtained on an IBM SP2 by using PVM are presented and analyzed. The interest of asynchronous iterative methods with flexible communication is clearly shown

    Programming models, compilers, and runtime systems for accelerator computing

    Get PDF
    Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance computing. Typically, the accelerators act as co-processors, with discrete memory spaces. They possess massive parallelism, along with many other unique architectural features. In order to obtain high performance, these features must be carefully exploited, which requires high programmer expertise. This thesis presents new programming models, and the necessary compiler and runtime systems to ease the accelerator programming process, while obtaining high performance
    • …
    corecore