1 research outputs found
Optimizing the domain wall fermion Dirac operator using the R-Stream source-to-source compiler
The application of the Dirac operator on a spinor field, the Dslash
operation, is the most computation-intensive part of the lattice QCD
simulations. It is often the key kernel to optimize to achieve maximum
performance on various platforms. Here we report on a project to optimize the
domain wall fermion Dirac operator in Columbia Physics System (CPS) using the
R-Stream source-to-source compiler. Our initial target platform is the Intel PC
clusters. We discuss the optimization strategies involved before and after the
automatic code generation with R-Stream and present some preliminary benchmark
results.Comment: 7 pages, 4 figures. Proceedings of the 33rd International Symposium
on Lattice Field Theory, July 14 -18, 2015, Kobe, Japa