2 research outputs found

    A Flexible and Portable Approach for Communication in Distributed Computing Systems

    No full text
    <p>Talk by Erik Zenker held at his diploma defence at the 17th december 2014<br>  </p

    PIConGPU, Alpaka, and cupla software bundle for IWOPH 2016 submission

    No full text
    <p>This is the archive containing the software used for evaluations in the publication "Performance-Portable Many-Core Plasma Simulations: Porting PIConGPU to OpenPower and Beyond" submitted to the international workshop on OpenPOWER for HPC 2016.</p> <p>The archive has the following content:</p> <p>PIConGPU Kelvin-Helmholtz Simulation code (picongpu-alpaka/):</p> <ul> <li> Remote: https://github.com/psychocoderHPC/picongpu-alpaka.git</li> <li> Branch: topic-scaling</li> <li> Commit: 1f004c8e0514ad1649f3958a6184878af6e75150</li> </ul> <p>Alpaka code (alpaka/):</p> <ul> <li>Remote: https://github.com/psychocoderHPC/alpaka.git</li> <li>Branch: topic-picongpu-alpaka</li> <li>Commit: 4a6dd35a9aff62e7f500623c3658685f827f73e5</li> </ul> <p>Cupla (cupla/):</p> <ul> <li>Remote: https://github.com/psychocoderHPC/cupla.git</li> <li>Branch: topic-dualAccelerators</li> <li>Commit: 4660f5fd8e888aa732230946046219f7e5daa1c9</li> </ul> <p>The simulation was executed for one thousand time steps and the following configuration:</p> <ul> <li>   shape is higher then CIC, we used TSC</li> <li>   pusher is Boris</li> <li>   current solver is Esirkepov (optimized, generalized)</li> <li>   Yee field solver</li> <li>   trilinear interpolation in field gathering</li> <li>   16 particles per cell</li> </ul> <p>Compile flags:</p> <ul> <li>CPU g++-4.9.2: -g0 -O3 -m64 -funroll-loops -march=native -ffast-math --param max-unroll-times=512</li> <li>GPU nvcc: --use_fast_math --ftz=false -g0 -O3 -m64</li> </ul
    corecore