Search CORE

1,225 research outputs found

Parallel Low-Storage Runge-Kutta Solvers for ODE Systems with Limited Access Distance

Author: Korch Matthias
Rauber Thomas
Publication venue
Publication date: 01/01/2010
Field of study

Parallelization Strategies for Density Matrix Renormalization Group Algorithms on Shared-Memory Systems

Author: E. Jeckelmann
Fehske
G. Hager
G. Wellein
Goedecker
Gutzwiller
H. Fehske
Holstein
Hubbard
Jeckelmann
Kanamori
Nishimoto
Wellein
White
White
Publication venue: 'Elsevier BV'
Publication date: 20/05/2003
Field of study

Shared-memory parallelization (SMP) strategies for density matrix renormalization group (DMRG) algorithms enable the treatment of complex systems in solid state physics. We present two different approaches by which parallelization of the standard DMRG algorithm can be accomplished in an efficient way. The methods are illustrated with DMRG calculations of the two-dimensional Hubbard model and the one-dimensional Holstein-Hubbard model on contemporary SMP architectures. The parallelized code shows good scalability up to at least eight processors and allows us to solve problems which exceed the capability of sequential DMRG calculations.Comment: 18 pages, 9 figure

arXiv.org e-Print Archive

Coordinated Caching for High Performance Calibration using Z -> µµ Events of the CMS Experiment

Author: Fischer Max
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2016
Field of study

Calibration of the detectors is a prerequisite for almost all physics analyses conducted as part of the LHC experiment. As such, both speed and precision are critical. As part of this thesis, a high performance analysis infrastructure using coordinated caching has been developed. This has been used to conduct the first calibration of jets using Z -> µµ events recorded during the second LHC run at the CMS experiment

Parallel Low-Storage Runge—Kutta Solvers for ODE Systems with Limited Access Distance

Author: Allen R.
Baskaran M.M.
Berland J.
Bilmes J.
Brankin R.W.
Burrage K.
Butenhof D.R.
Fehlberg E.
Hairer E.
Hull T.E.
Kennedy C.A.
Korch M.
Korch M.
Matthias Korch
Rauber T.
Schiesser W.E.
Snir M.
Strehmel K.
Thomas Rauber
Tiwari A.
White J.
Wolfe M.
Publication venue: 'SAGE Publications'
Publication date
Field of study

Recommended from our members

Automation of Determination of Optimal Intra-Compute Node Parallelism

Author: Brown James C.
Gómez-Iglesias Antonio
Publication venue
Publication date: 01/01/2016
Field of study

Maximizing the productivity of modern multicore and manycore chips requires optimizing parallelism at the compute node level. This is, however, a complex multi-step process. It is an iterative method requiring determining optimal degrees of parallel scalability and optimizing memory access behavior. Further, there are multiple cases to be considered, programs which use only MPI or OpenMP and hybrid (MPI +OpenMP) programs. This paper presents a set of three coordinated workﬂows for determining the optimal parallelism at the program level for MPI programs and at the loop level for hybrid (MPI+OpenMP) cases. The paper also details mostly automated implementations of these workﬂows using the PerfExpert infrastructure. Finally the paper presents case studies demonstrating both the applicability and the effectiveness of optimizing parallelism at the compute node level. The results shown in the paper will provide valuable information to further advance in the full automation of the workﬂows. The software implementing the parallelism scalability optimization is open source and available for download.Texas Advanced Computing Center (TACC)Computer Science

Texas ScholarWorks

Enhancing Energy Production with Exascale HPC Methods

Author: Camata José J.
Cela José M.
Costa Danilo
Coutinho Alvaro LGA
Fernández-Galisteo Daniel
Jiménez Carmen
Kourdioumov Vadim
Mattoso Marta
Mayo-García Rafael
Miras Thomas
Moríñigo José A.
Navarro Jorge
Navaux Philippe O.A.
Oliveira Daniel de
Rodríguez-Pascual Manuel
Silva Vítor
Souza Renan
Valduriez Patrick
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

High Performance Computing (HPC) resources have become the key actor for achieving more ambitious challenges in many disciplines. In this step beyond, an explosion on the available parallelism and the use of special purpose processors are crucial. With such a goal, the HPC4E project applies new exascale HPC techniques to energy industry simulations, customizing them if necessary, and going beyond the state-of-the-art in the required HPC exascale simulations for different energy sources. In this paper, a general overview of these methods is presented as well as some specific preliminary results.The research leading to these results has received funding from the European Union's Horizon 2020 Programme (2014-2020) under the HPC4E Project (www.hpc4e.eu), grant agreement n° 689772, the Spanish Ministry of Economy and Competitiveness under the CODEC2 project (TIN2015-63562-R), and from the Brazilian Ministry of Science, Technology and Innovation through Rede Nacional de Pesquisa (RNP). Computer time on Endeavour cluster is provided by the Intel Corporation, which enabled us to obtain the presented experimental results in uncertainty quantification in seismic imagingPostprint (author's final draft

INRIA a CCSD electronic archive server

HAL-Rennes 1