1,309 research outputs found

    Multifrontal QR Factorization for Multicore Architectures over Runtime Systems

    Get PDF
    International audienceTo face the advent of multicore processors and the ever increasing complexity of hardware architectures, programming models based on DAG parallelism regained popularity in the high performance, scientific computing community. Modern runtime systems offer a programming interface that complies with this paradigm and powerful engines for scheduling the tasks into which the application is decomposed. These tools have already proved their effectiveness on a number of dense linear algebra applications. This paper evaluates the usability of runtime systems for complex applications, namely, sparse matrix multifrontal factorizations which constitute extremely irregular workloads, with tasks of different granularities and characteristics and with a variable memory consumption. Experimental results on real-life matrices show that it is possible to achieve the same efficiency as with an ad hoc scheduler which relies on the knowledge of the algorithm. A detailed analysis shows the performance behavior of the resulting code and possible ways of improving the effectiveness of runtime systems

    Implementing multifrontal sparse solvers for multicore architectures with Sequential Task Flow runtime systems

    Get PDF
    International audienceTo face the advent of multicore processors and the ever increasing complexity of hardware architectures, programming models based on DAG parallelism regained popularity in the high performance, scientific computing community. Modern runtime systems offer a programming interface that complies with this paradigm and powerful engines for scheduling the tasks into which the application is decomposed. These tools have already proved their effectiveness on a number of dense linear algebra applications. This paper evaluates the usability and effectiveness of runtime systems based on the Sequential Task Flow model for complex applications , namely, sparse matrix multifrontal factorizations which feature extremely irregular workloads, with tasks of different granularities and characteristics and with a variable memory consumption. Most importantly, it shows how this parallel programming model eases the development of complex features that benefit the performance of sparse, direct solvers as well as their memory consumption. We illustrate our discussion with the multifrontal QR factorization running on top of the StarPU runtime system. ACM Reference Format: Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche and Florent Lopez, 2014. Implementing multifrontal sparse solvers for multicore architectures with Sequential Task Flow runtime system

    Exploiting a Parametrized Task Graph model for the parallelization of a sparse direct multifrontal solver

    Get PDF
    International audienceThe advent of multicore processors requires to reconsider the design of high performance computing libraries to embrace portable and effective techniques of parallel software engineering. One of the most promising approaches consists in abstracting an application as a directed acyclic graph (DAG) of tasks. While this approach has been popularized for shared memory environments by the OpenMP 4.0 standard where dependencies between tasks are automatically inferred, we investigate an alternative approach, capable of describing the DAG of task in a distributed setting, where task dependencies are explicitly encoded. So far this approach has been mostly used in the case of algorithms with a regular data access pattern and we show in this study that it can be efficiently applied to a higly irregular numerical algorithm such as a sparse multifrontal QR method. We present the resulting implementation and discuss the potential and limits of this approach in terms of productivity and effectiveness in comparison with more common parallelization techniques. Although at an early stage of development, preliminary results show the potential of the parallel programming model that we investigate in this work

    The Blue Emission at 2.8 EV in Strontium Titanate: Evidence for a Radiative Transition of Self-Trapped Excitons from Unbound States

    Get PDF
    The origin of the blue emission in SrTiO3 has been investigated as a function of irradiation fluence, electronic excitation density, and temperature using a range of ion energies and masses. The emission clearly does not show correlation with the concentration of vacancies generated by irradiation but is greatly enhanced under heavy-ion irradiation. The intensity ratio of the 2.8 and 2.5 eV bands is independent of fluence at all temperatures, but it increases with excitation rate. The 2.8 eV emission is proposed to correspond to a transition from conduction band states to the ground state level of the self-trapped exciton center

    Recent Advances on Carrier and Exciton Self-Trapping in Strontium Titanate: Understanding the Luminescence Emissions

    Get PDF
    An up-to-date review on recent results for self-trapping of free electrons and holes, as well as excitons, in strontium titanate (STO), which gives rise to small polarons and self-trapped excitons (STEs) is presented. Special attention is paid to the role of carrier and exciton self-trapping on the luminescence emissions under a variety of excitation sources with special emphasis on experiments with laser pulses and energetic ion-beams. In spite of the extensive research effort, a definitive identification of such localized states, as well as a suitable understanding of their operative light emission mechanisms, has remained lacking or controversial. However, promising advances have been recently achieved and are the objective of the present review. In particular, significant theoretical advances in the understanding of electron and hole self-trapping are discussed. Also, relevant experimental advances in the kinetics of light emission associated with electron-hole recombination have been obtained through time-resolved experiments using picosecond (ps) laser pulses. The luminescence emission mechanisms and the light decay processes from the self-trapped excitons are also reviewed. Recent results suggest that the blue emission at 2.8 eV, often associated with oxygen vacancies, is related to a transition from unbound conduction levels to the ground singlet state of the STE. The stabilization of small electron polarons by oxygen vacancies and its connection with luminescence emission are discussed in detail. Through ion-beam irradiation experiments, it has recently been established that the electrons associated with the vacancy constitute electron polaron states (Ti3+) trapped in the close vicinity of the empty oxygen sites. These experimental results have allowed for the optical identification of the oxygen vacancy center through a red luminescence emission centered at 2.0 eV. Ab-initio calculations have provided strong support for those experimental findings. Finally, the use of Cr-doped STO has offered a way to monitor the interplay between the chromium centers and oxygen vacancies as trapping sites for the electron and hole partners resulting from the electronic excitation

    Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers

    Get PDF
    International audienceThe ever growing complexity and scale of parallel architectures imposes to rewrite classical monolithic HPC scientific applications and libraries as their portability and performance optimization only comes at a prohibitive cost. There is thus a recent and general trend in using instead a modular approach where numerical algorithms are written at a high level independently of the hardware architecture as Directed Acyclic Graphs (DAG) of tasks. A task-based runtime system then dynamically schedules the resulting DAG on the different computing resources, automatically taking care of data movement and taking into account the possible speed heterogeneity and variability. Evaluating the performance of such complex and dynamic systems is extremely challenging especially for irregular codes. In this article, we explain how we crafted a faithful simulation, both in terms of performance and memory usage, of the behavior of qr_mumps, a fully-featured sparse linear algebra library, on multi-core architectures. In our approach, the target high-end machines are calibrated only once to derive sound performance models. These models can then be used at will to quickly predict and study in a reproducible way the performance of such irregular and resource-demanding applications using solely a commodity laptop

    Ion Beam irradiation of copper nitride: electronic vs elastic-collision mechanism

    Full text link
    Copper nitride is a metastable material which results very attractive because of their potential to be used in functional device. Cu3 N easily decomposes into Cu and N2 by annealing [1] or irradiation (electron, ions, laser) [2, 3]. Previous studies carried out in N-rich Cu3 N films irradiated with Cu at 42MeV evidence a very efficient sputtering of N whose yield (5×10 3 atom/ion), for a film with a thickness of just 100 nm, suggest that the origin of the sputtering has an electronic nature. This N depletion was observed to be responsible for new phase formation ( Cu2 O) and pure Cu [4

    Ionoluminescence on α-quartz: mechanisms and modeling

    Get PDF
    Ionoluminescence of α - quartz exhibits two dominant emission bands peaking at 1.9 eV. (NBOHCs) and 2.7 eV (STEs. The evolution of the red emission yield does not show a correlation with the concentrations of neither the NBOHC nor with that of other color centers. The blue emission yield closely follows the amorphization kinetics independently measured by RBS/C spectrometry. A simple theoretical model has been proposed; it assumes that the formation and recombination of STEs are the primary event and both, the light emissions and the lattice structural damage are a consequence this phenomenon. The model leads to several simple mathematical equations that can be used to simulate the IL yields and provide a reasonable fit to experimental kinetic data

    Refractive index changes in amorphous SiO2 (silica) by swift ion irradiation

    Get PDF
    The refractive index changes induced by swift ion-beam irradiation in silica have been measured either by spectroscopic ellipsometry or through the effective indices of the optical modes propagating through the irradiated structure. The optical response has been analyzed by considering an effective homogeneous medium to simulate the nanostructured irradiated system consisting of cylindrical tracks, associated to the ion impacts, embedded into a virgin material. The role of both, irradiation fluence and stopping power, has been investigated. Above a certain electronic stopping power threshold (∌2.5 keV/nm), every ion impact creates an axial region around the trajectory with a fixed refractive index (around n = 1.475) corresponding to a certain structural phase that is independent of stopping power. The results have been compared with previous data measured by means of infrared spectroscopy and small-angle X-ray scattering; possible mechanisms and theoretical models are discussed

    Amorphization kinectics under swift heavy ion irradiation: a cumulative overlapping-track approach

    Get PDF
    A simple illustrative physical model is presented to describe the kinetics of damage and amorphization by swiftheavyions (SHI) in LiNbO3. The model considers that every ion impact generates initially a defective region (halo) and a full amorphous core whose relative size depends on the electronic stopping power. Below a given stopping power threshold only a halo is generated. For increasing fluences the amorphized area grows monotonically via overlapping of a fixed number N of halos. In spite of its simplicity the model, which provides analytical solutions, describes many relevant features of the kinetic behaviour. In particular, it predicts approximate Avrami curves with parameters depending on stopping power in qualitative accordance with experiment that turn into Poisson laws well above the threshold valu
    • 

    corecore