Search CORE

1,309 research outputs found

Multifrontal QR Factorization for Multicore Architectures over Runtime Systems

Author: Agullo Emmanuel
Buttari Alfredo
Guermouche Abdou
Lopez Florent
Publication venue: HAL CCSD
Publication date: 01/01/2013
Field of study

International audienceTo face the advent of multicore processors and the ever increasing complexity of hardware architectures, programming models based on DAG parallelism regained popularity in the high performance, scientific computing community. Modern runtime systems offer a programming interface that complies with this paradigm and powerful engines for scheduling the tasks into which the application is decomposed. These tools have already proved their effectiveness on a number of dense linear algebra applications. This paper evaluates the usability of runtime systems for complex applications, namely, sparse matrix multifrontal factorizations which constitute extremely irregular workloads, with tasks of different granularities and characteristics and with a variable memory consumption. Experimental results on real-life matrices show that it is possible to achieve the same efficiency as with an ad hoc scheduler which relies on the knowledge of the algorithm. A detailed analysis shows the performance behavior of the resulting code and possible ways of improving the effectiveness of runtime systems

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

HAL-Rennes 1

Implementing multifrontal sparse solvers for multicore architectures with Sequential Task Flow runtime systems

Author: Agullo Emmanuel
Buttari Alfredo
Guermouche Abdou
Lopez Florent
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2016
Field of study

International audienceTo face the advent of multicore processors and the ever increasing complexity of hardware architectures, programming models based on DAG parallelism regained popularity in the high performance, scientific computing community. Modern runtime systems offer a programming interface that complies with this paradigm and powerful engines for scheduling the tasks into which the application is decomposed. These tools have already proved their effectiveness on a number of dense linear algebra applications. This paper evaluates the usability and effectiveness of runtime systems based on the Sequential Task Flow model for complex applications , namely, sparse matrix multifrontal factorizations which feature extremely irregular workloads, with tasks of different granularities and characteristics and with a variable memory consumption. Most importantly, it shows how this parallel programming model eases the development of complex features that benefit the performance of sparse, direct solvers as well as their memory consumption. We illustrate our discussion with the multifrontal QR factorization running on top of the StarPU runtime system. ACM Reference Format: Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche and Florent Lopez, 2014. Implementing multifrontal sparse solvers for multicore architectures with Sequential Task Flow runtime system

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-Rennes 1

Exploiting a Parametrized Task Graph model for the parallelization of a sparse direct multifrontal solver

Author: Agullo Emmanuel
Bosilca George
Buttari Alfredo
Guermouche Abdou
Lopez Florent
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/08/2016
Field of study

International audienceThe advent of multicore processors requires to reconsider the design of high performance computing libraries to embrace portable and effective techniques of parallel software engineering. One of the most promising approaches consists in abstracting an application as a directed acyclic graph (DAG) of tasks. While this approach has been popularized for shared memory environments by the OpenMP 4.0 standard where dependencies between tasks are automatically inferred, we investigate an alternative approach, capable of describing the DAG of task in a distributed setting, where task dependencies are explicitly encoded. So far this approach has been mostly used in the case of algorithms with a regular data access pattern and we show in this study that it can be efficiently applied to a higly irregular numerical algorithm such as a sparse multifrontal QR method. We present the resulting implementation and discuss the potential and limits of this approach in terms of productivity and effectiveness in comparison with more common parallelization techniques. Although at an early stage of development, preliminary results show the potential of the parallel programming model that we investigate in this work

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-Rennes 1

The Blue Emission at 2.8 EV in Strontium Titanate: Evidence for a Radiative Transition of Self-Trapped Excitons from Unbound States

Author: Agullo-Lopez Fernando
Crespillo Miguel L.
Graham Joseph T.
Weber William J.
Zhang Y.
Publication venue: Scholars\u27 Mine
Publication date: 01/07/2019
Field of study

The origin of the blue emission in SrTiO3 has been investigated as a function of irradiation fluence, electronic excitation density, and temperature using a range of ion energies and masses. The emission clearly does not show correlation with the concentration of vacancies generated by irradiation but is greatly enhanced under heavy-ion irradiation. The intensity ratio of the 2.8 and 2.5 eV bands is independent of fluence at all temperatures, but it increases with excitation rate. The 2.8 eV emission is proposed to correspond to a transition from conduction band states to the ground state level of the self-trapped exciton center

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Recent Advances on Carrier and Exciton Self-Trapping in Strontium Titanate: Understanding the Luminescence Emissions

Author: Agullo-Lopez Fernando
Crespillo Miguel L.
Graham Joseph T.
Weber William J.
Zhang Yanwen
Publication venue: Scholars\u27 Mine
Publication date: 01/02/2019
Field of study

An up-to-date review on recent results for self-trapping of free electrons and holes, as well as excitons, in strontium titanate (STO), which gives rise to small polarons and self-trapped excitons (STEs) is presented. Special attention is paid to the role of carrier and exciton self-trapping on the luminescence emissions under a variety of excitation sources with special emphasis on experiments with laser pulses and energetic ion-beams. In spite of the extensive research effort, a definitive identification of such localized states, as well as a suitable understanding of their operative light emission mechanisms, has remained lacking or controversial. However, promising advances have been recently achieved and are the objective of the present review. In particular, significant theoretical advances in the understanding of electron and hole self-trapping are discussed. Also, relevant experimental advances in the kinetics of light emission associated with electron-hole recombination have been obtained through time-resolved experiments using picosecond (ps) laser pulses. The luminescence emission mechanisms and the light decay processes from the self-trapped excitons are also reviewed. Recent results suggest that the blue emission at 2.8 eV, often associated with oxygen vacancies, is related to a transition from unbound conduction levels to the ground singlet state of the STE. The stabilization of small electron polarons by oxygen vacancies and its connection with luminescence emission are discussed in detail. Through ion-beam irradiation experiments, it has recently been established that the electrons associated with the vacancy constitute electron polaron states (Ti3+) trapped in the close vicinity of the empty oxygen sites. These experimental results have allowed for the optical identification of the oxygen vacancy center through a red luminescence emission centered at 2.0 eV. Ab-initio calculations have provided strong support for those experimental findings. Finally, the use of Cr-doped STO has offered a way to monitor the interplay between the chromium centers and oxygen vacancies as trapping sites for the electron and hole partners resulting from the electronic excitation

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers

Author: Agullo Emmanuel
Buttari Alfredo
Guermouche Abdou
Legrand Arnaud
Lopez Florent
Stanisic Luka
Videau Brice
Publication venue: HAL CCSD
Publication date: 14/12/2015
Field of study

International audienceThe ever growing complexity and scale of parallel architectures imposes to rewrite classical monolithic HPC scientific applications and libraries as their portability and performance optimization only comes at a prohibitive cost. There is thus a recent and general trend in using instead a modular approach where numerical algorithms are written at a high level independently of the hardware architecture as Directed Acyclic Graphs (DAG) of tasks. A task-based runtime system then dynamically schedules the resulting DAG on the different computing resources, automatically taking care of data movement and taking into account the possible speed heterogeneity and variability. Evaluating the performance of such complex and dynamic systems is extremely challenging especially for irregular codes. In this article, we explain how we crafted a faithful simulation, both in terms of performance and memory usage, of the behavior of qr_mumps, a fully-featured sparse linear algebra library, on multi-core architectures. In our approach, the target high-end machines are calibrated only once to derive sound performance models. These models can then be used at will to quickly predict and study in a reproducible way the performance of such irregular and resource-demanding applications using solely a commodity laptop

Scientific Publications of the University of Toulouse II Le Mirail

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Ion Beam irradiation of copper nitride: electronic vs elastic-collision mechanism

Author: Agullo Lopez Fernando
Gonzalez Arrabal Raquel
Gordillo Garcia Nuria
Munnik F.
Rivera de Mena Antonio
Publication venue: E.T.S.I. Industriales (UPM)
Publication date: 01/01/2011
Field of study

Copper nitride is a metastable material which results very attractive because of their potential to be used in functional device. Cu3 N easily decomposes into Cu and N2 by annealing [1] or irradiation (electron, ions, laser) [2, 3]. Previous studies carried out in N-rich Cu3 N films irradiated with Cu at 42MeV evidence a very efficient sputtering of N whose yield (5×10 3 atom/ion), for a film with a thickness of just 100 nm, suggest that the origin of the sputtering has an electronic nature. This N depletion was observed to be responsible for new phase formation ( Cu2 O) and pure Cu [4

Archivo Digital UPM

Ionoluminescence on α-quartz: mechanisms and modeling

Author: Agullo Lopez Fernando
Jimenez Rey D.
Olivares J.
Peña Rodríguez Ovidio Y.
Rivera de Mena Antonio
Publication venue: E.T.S.I. Industriales (UPM)
Publication date: 01/01/2012
Field of study

Ionoluminescence of α - quartz exhibits two dominant emission bands peaking at 1.9 eV. (NBOHCs) and 2.7 eV (STEs. The evolution of the red emission yield does not show a correlation with the concentrations of neither the NBOHC nor with that of other color centers. The blue emission yield closely follows the amorphization kinetics independently measured by RBS/C spectrometry. A simple theoretical model has been proposed; it assumes that the formation and recombination of STEs are the primary event and both, the light emissions and the lattice structural damage are a consequence this phenomenon. The model leads to several simple mathematical equations that can be used to simulate the IL yields and provide a reasonable fit to experimental kinetic data

Archivo Digital UPM

Refractive index changes in amorphous SiO2 (silica) by swift ion irradiation

Author: Agullo Lopez Fernando
Manzano Santamaría Javier
Olivares J.
Peña Rodríguez Ovidio Y.
Rivera de Mena Antonio
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

The refractive index changes induced by swift ion-beam irradiation in silica have been measured either by spectroscopic ellipsometry or through the effective indices of the optical modes propagating through the irradiated structure. The optical response has been analyzed by considering an effective homogeneous medium to simulate the nanostructured irradiated system consisting of cylindrical tracks, associated to the ion impacts, embedded into a virgin material. The role of both, irradiation fluence and stopping power, has been investigated. Above a certain electronic stopping power threshold (∼2.5 keV/nm), every ion impact creates an axial region around the trajectory with a fixed refractive index (around n = 1.475) corresponding to a certain structural phase that is independent of stopping power. The results have been compared with previous data measured by means of infrared spectroscopy and small-angle X-ray scattering; possible mechanisms and theoretical models are discussed

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Amorphization kinectics under swift heavy ion irradiation: a cumulative overlapping-track approach

Author: Agullo Lopez Fernando
Crespillo Almenara Miguel
García G.
Gordillo N.
Olivares Roza Jimena
Rivera de Mena Antonio
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

A simple illustrative physical model is presented to describe the kinetics of damage and amorphization by swiftheavyions (SHI) in LiNbO3. The model considers that every ion impact generates initially a defective region (halo) and a full amorphous core whose relative size depends on the electronic stopping power. Below a given stopping power threshold only a halo is generated. For increasing fluences the amorphized area grows monotonically via overlapping of a fixed number N of halos. In spite of its simplicity the model, which provides analytical solutions, describes many relevant features of the kinetic behaviour. In particular, it predicts approximate Avrami curves with parameters depending on stopping power in qualitative accordance with experiment that turn into Poisson laws well above the threshold valu

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Digital.CSIC

Archivo Digital UPM