Search CORE

5 research outputs found

mallocMC 2.0.0crp: Policy Based Design

Author: Axel Huebl (5261149)
Carlchristian Eckert (5268979)
René Widera (5261146)
Publication venue
Publication date
Field of study

This release introduces mallocMC, which contains the previous algorithm and much code from ScatterAlloc 1.0.2crp. The project was renamed due to massive restructurization. The code uses ScatterAlloc as a reference algorithm, but can be extended to include other allocators in the future. We closed all issues documented in the Milestone Get Lib ready for PIConGPU.</p

FigShare

mallocMC 2.1.0crp: malloc Interface, Performance, Bugs

Author: Axel Huebl (5261149)
Carlchristian Eckert (5268979)
René Widera (5261146)
Publication venue
Publication date
Field of study

This release fixes some bugs that occured after the release of 2.0.1crp and reduces the interface to improve interoperability with the default CUDA allocator. The performance for large pages was improved. We closed all issues documented in Milestone New Features.</p

FigShare

mallocMC: 2.0.1crp: Bugfixes

Author: Axel Huebl (5261149)
Carlchristian Eckert (5268979)
René Widera (5261146)
Publication venue
Publication date
Field of study

This release fixes several bugs that occurred after the release of 2.0.0crp. We closed all issues documented in Milestone Bugfixes.</p

FigShare

Supplementary Materials for "On the Scalability of Data Reduction Techniques in Current and Upcoming HPC Systems from an Application Perspective"

Author: Alexander Matthes (5385304)
Axel Huebl (5261149)
Felix Schmitt (5261137)
Jong Youl Choi (5385298)
Michael Bussmann (5261152)
Norbert Podhorszki (5385295)
René Widera (5261146)
Scott Klasky (5385301)
Publication venue
Publication date
Field of study

Supplementary materials with all used benchmark scripts, plot scripts, benchmark results and PIConGPU example data for the submission to "The 1st International Workshop on Data Reduction for Big Scientific Data (DRBSD-1)" held in conjunction with ISC 2017 in Frankfurt, Germany.</p

FigShare

PIConGPU, Alpaka, and cupla software bundle for IWOPH 2016 submission

Author: Andreas Knüpfer (5287657)
Axel Huebl (5261149)
Benjamin Worpitz (5268976)
Erik Zenker (5268298)
Guido Juckeland (5287654)
Michael Bussmann (5261152)
René Widera (5261146)
Wolfgang E. Nagel (5287660)
Publication venue
Publication date
Field of study

This is the archive containing the software used for evaluations in the publication "Performance-Portable Many-Core Plasma Simulations: Porting PIConGPU to OpenPower and Beyond" submitted to the international workshop on OpenPOWER for HPC 2016. The archive has the following content: PIConGPU Kelvin-Helmholtz Simulation code (picongpu-alpaka/): <ul> <li> Remote: https://github.com/psychocoderHPC/picongpu-alpaka.git</li> <li> Branch: topic-scaling</li> <li> Commit: 1f004c8e0514ad1649f3958a6184878af6e75150</li> </ul> Alpaka code (alpaka/): <ul> <li>Remote: https://github.com/psychocoderHPC/alpaka.git</li> <li>Branch: topic-picongpu-alpaka</li> <li>Commit: 4a6dd35a9aff62e7f500623c3658685f827f73e5</li> </ul> Cupla (cupla/): <ul> <li>Remote: https://github.com/psychocoderHPC/cupla.git</li> <li>Branch: topic-dualAccelerators</li> <li>Commit: 4660f5fd8e888aa732230946046219f7e5daa1c9</li> </ul> The simulation was executed for one thousand time steps and the following configuration: <ul> <li> shape is higher then CIC, we used TSC</li> <li> pusher is Boris</li> <li> current solver is Esirkepov (optimized, generalized)</li> <li> Yee field solver</li> <li> trilinear interpolation in field gathering</li> <li> 16 particles per cell</li> </ul> Compile flags: <ul> <li>CPU g++-4.9.2: -g0 -O3 -m64 -funroll-loops -march=native -ffast-math --param max-unroll-times=512</li> <li>GPU nvcc: --use_fast_math --ftz=false -g0 -O3 -m64</li> </ul

FigShare