213 research outputs found

    A Parallel Iterative Method for Computing Molecular Absorption Spectra

    Full text link
    We describe a fast parallel iterative method for computing molecular absorption spectra within TDDFT linear response and using the LCAO method. We use a local basis of "dominant products" to parametrize the space of orbital products that occur in the LCAO approach. In this basis, the dynamical polarizability is computed iteratively within an appropriate Krylov subspace. The iterative procedure uses a a matrix-free GMRES method to determine the (interacting) density response. The resulting code is about one order of magnitude faster than our previous full-matrix method. This acceleration makes the speed of our TDDFT code comparable with codes based on Casida's equation. The implementation of our method uses hybrid MPI and OpenMP parallelization in which load balancing and memory access are optimized. To validate our approach and to establish benchmarks, we compute spectra of large molecules on various types of parallel machines. The methods developed here are fairly general and we believe they will find useful applications in molecular physics/chemistry, even for problems that are beyond TDDFT, such as organic semiconductors, particularly in photovoltaics.Comment: 20 pages, 17 figures, 3 table

    Modélisation hautes fréquences en magnétohydrodynamique : aspect magnétique

    Get PDF
    Dans ce papier nous rappelons la modélisation de l'induction magnétique générée par des courants alternatifs de haute fréquence pour le traitement des métaux liquides. A l'aide des méthodes asymptotiques nous construisons des modèles limites lorsque la fréquence du courant imposée est grande

    Numerical approximation of free boundary problem arising in electromagnetic shaping

    Get PDF
    We approximate numerically the section of a column of liquid metal submitted to an external electromagnetic field. The bidimensional model used here leads us to solve by a spectral method a nonlinear boundary problem posed on the exterior of the unit disk to find a convenient conformal mapping. We give accurates bound of the errors obtained during our approximation process. We present some examples of shapes for both exterior and interior shaping

    Pipelining the Fast Multipole Method over a Runtime System

    Get PDF
    Fast Multipole Methods (FMM) are a fundamental operation for the simulation of many physical problems. The high performance design of such methods usually requires to carefully tune the algorithm for both the targeted physics and the hardware. In this paper, we propose a new approach that achieves high performance across architectures. Our method consists of expressing the FMM algorithm as a task flow and employing a state-of-the-art runtime system, StarPU, in order to process the tasks on the different processing units. We carefully design the task flow, the mathematical operators, their Central Processing Unit (CPU) and Graphics Processing Unit (GPU) implementations, as well as scheduling schemes. We compute potentials and forces of 200 million particles in 48.7 seconds on a homogeneous 160 cores SGI Altix UV 100 and of 38 million particles in 13.34 seconds on a heterogeneous 12 cores Intel Nehalem processor enhanced with 3 Nvidia M2090 Fermi GPUs.Comment: No. RR-7981 (2012

    Asymptotic Analysis of Magnetic Induction with High Frequency for Solid Conducteurs

    Get PDF
    In this paper we describe the behaviour both in time and in space of an induction field created by an imposed high frequency alternating current around a solid conductor. To do this, we introduce two time scales and we decompose the induction field in a mean field and an oscillating field. With the help of singular perturbations theory and multiple scales method we obtain two uncoupled models; one for the large time scale and the other for the high frequencies. For a cross section of a solid column of metal, we build the two first terms of the asymptotic expansion of the induction field. Moreover, we justify the classical harmonic approximation used in such configuration. Finally in the case of a cylinder column, we apply the previous results by computing numerically the induction for different values of the frequency

    On some orthogonalization schemes in Tensor Train format

    Full text link
    In the framework of tensor spaces, we consider orthogonalization kernels to generate an orthogonal basis of a tensor subspace from a set of linearly independent tensors. In particular, we experimentally study the loss of orthogonality of six orthogonalization methods, namely Classical and Modified Gram-Schmidt with (CGS2, MGS2) and without (CGS, MGS) re-orthogonalization, the Gram approach, and the Householder transformation. To overcome the curse of dimensionality, we represent tensors with a low-rank approximation using the Tensor Train (TT) formalism. In addition, we introduce recompression steps in the standard algorithm outline through the TT-rounding method at a prescribed accuracy. After describing the structure and properties of the algorithms, we illustrate their loss of orthogonality with numerical experiments. The theoretical bounds from the classical matrix computation round-off analysis, obtained over several decades, seem to be maintained, with the unit round-off replaced by the TT-rounding accuracy. The computational analysis for each orthogonalization kernel in terms of the memory requirements and the computational complexity measured as a function of the number of TT-rounding, which happens to be the most computationally expensive operation, completes the study

    Contributions algorithmiques pour les simulations complexes en physique des matériaux

    Get PDF
    Because computer performance is always increasing, the numerical simulations of physical phenomena become much harder. Where does this complexity come from ? On the one hand, for a higher fidelity of the solution problem, more advanced physical models are introduced ; on the other hand, the size of the discretization is shrinking which leads to very large-scale numerical problems. In addition the computers themselves become more complex, for instance with hierar- chical memories and hierarchical process units. Hence the need to redesign algorithms, numerical schemes and software to make them more efficient and effective on emerging architectures.The work layout in this document is divided in two mainlines. First, the development and the parallel design of the fast multipole method to compute pair-wise interactions. Then the coupling of models/methods and codes in material physics at atomistic scale, and the computational steering of these simulations as well.Avec l’accroissement de la puissance des ordinateurs, les simulations numériques de phénomènes physiques deviennent de plus en plus complexes. Cette complexité provient d’une part, de l’ajout de modèles physiques plus compliqués pour mieux représenter la physique et d’autre part, par une discrétisation plus fine du problème conduisant à des problèmes de grande taille. Une difficulté supplémentaire s’ajoute avec la complexification des ordinateurs notamment par des mémoires et des unités de calculs hiérarchiques. Il est donc nécessaire de redessiner les algorithmiques, les schémas numériques et les logiciels pour les rendre plus efficaces sur ces architectures émergentes. Le travail présenté dans ce document correspond à deux axes de recherche. Le premier porte sur le développement et la parallélisation de la méthode des multipôles rapides pour calculer rapidement des interactions de paires. Le deuxième axe concerne le couplage de méthodes et de codes en physique des matériaux à échelle atomique ainsi que le pilotage de ces simulations
    • …
    corecore