3,278 research outputs found
Recommended from our members
Automation of Determination of Optimal Intra-Compute Node Parallelism
Maximizing the productivity of modern multicore and manycore chips requires optimizing parallelism at the compute node level. This is, however, a complex multi-step process. It is an iterative method requiring determining optimal degrees of parallel scalability and optimizing memory access behavior. Further, there are multiple cases to be considered, programs which use only MPI or OpenMP and hybrid (MPI +OpenMP) programs. This paper presents a set of three coordinated workflows for determining the optimal parallelism at the program level for MPI programs and at the loop level for hybrid (MPI+OpenMP) cases. The paper also details mostly automated implementations of these workflows using the PerfExpert infrastructure. Finally the paper presents case studies demonstrating both the applicability and the effectiveness of optimizing parallelism at the compute node level. The results shown in the paper will provide valuable information to further advance in the full automation of the workflows. The software implementing the parallelism scalability optimization is open source and available for download.Texas Advanced Computing Center (TACC)Computer Science
Building initial models of rotating white dwarfs with SPH
A general procedure to build self-gravitational, rotating equilibrium structures with the Smoothed Particle Hydrodynamics (SPH) technique does not exist. In particular, obtaining
stable rotating configurations for white dwarf (WD) stars is
currently a major drawback of many astrophysical simulations.
Rotating WDs with low internal temperatures are connected with
both, explosive and implosive scenarios such as type Ia supernova
explosions or neutron stars formation. Simulations of these events
with SPH codes demand stable enough particle configurations as
initial models. In this work we have developed and tested a relaxation method to obtain equilibrium configurations of rotating
WDs. This method is straightforward and takes advantage of the
excellent mass and angular momentum conservation properties
of the SPH technique. Although we focus on rigid rotation and
its potential applications to several Type Ia supernova scenarios,
we also show that our proposal is also able to provide good initial
models in differential rotation, which has the potential to benefit
many other types of simulations where rotation plays a capital
role, like disk evolution and stellar formation.Peer ReviewedPostprint (published version
La Constitución apostólica «Ut sit» de 28-XI-1982. Acerca de su «Pars» narrativa.
Material incluido en el volumen especial de la revista del Instituto Martín de Azpilcueta, Universidad de Navarra : Ius Canonicum (1999), en honor de Javier Hervada
El octavo principio directivo para la reforma del "Codex Iuris Canonici": El iter de su formulación
Recommended from our members
Benchmarking the Intel®Xeon®Platinum 8160 Processor
This report presents a set of results for different microbenchmarks and applications on the Intel
Xeon Platinum8160 Processor, formerly known as Skylake. For simplicity, we will use both Skylake
and SKX to refer to this processor. We use the Skylake nodes that will be available in Stampede2.
This systemwill provide Intel Knights Landing and Skylake chips interconnected by a 100 Gb/sec
Intel Omni-Path (OPA) network with a fat tree topology. The peak performance of the system will
be 18 PF.Texas Advanced Computing Center (TACC
- …