577 research outputs found
Parallelization of subdomain methods with overlapping for linear and nonlinear convection-diffusion problems
International audienceLinear and nonlinear convection-diffusion problems are considered. The numerical solution of these problems via the Schwarz alternating method is studied. A new class of parallel asynchronous iterative methods with flexible communication is applied. The implementation of parallel asyn-chronous and synchronous algorithms on distributed memory multiprocessors is described. Experimental results obtained on an IBM SP2 by using PVM are presented and analyzed. The interest of asynchronous iterative methods with flexible communication is clearly shown
Programming models, compilers, and runtime systems for accelerator computing
Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance computing. Typically, the accelerators act as co-processors, with discrete memory spaces. They possess massive parallelism, along with many other unique architectural features. In order to obtain high performance, these features must be carefully exploited, which requires high programmer expertise. This thesis presents new programming models, and the necessary compiler and runtime systems to ease the accelerator programming process, while obtaining high performance
- …