15,875 research outputs found

    Comparision of direct and Fourier space techniques in time-dependent density functional theory

    Full text link
    Several techniques have appeared in the literatuare to solve the equations of time-dependent density functional theory. We compare the efficiency of different methods based on mesh representations of the wave function (direct and Fourier space), taking as a test case the calculation of the surface plasmon in the cluster Na_8. For smaller systems, the methods have comparable efficiency. For large systems the direct time method has a decided advantage in computer storage requirements. It is also more economical on arithmetic operations, but is not as suited for parallel computing as the methods based on a frequency representation.Comment: 20 pages, TeX (or Latex, etc), 5 tables and no figures; revised version with new abstrac

    Reproducibility, accuracy and performance of the Feltor code and library on parallel computer architectures

    Get PDF
    Feltor is a modular and free scientific software package. It allows developing platform independent code that runs on a variety of parallel computer architectures ranging from laptop CPUs to multi-GPU distributed memory systems. Feltor consists of both a numerical library and a collection of application codes built on top of the library. Its main target are two- and three-dimensional drift- and gyro-fluid simulations with discontinuous Galerkin methods as the main numerical discretization technique. We observe that numerical simulations of a recently developed gyro-fluid model produce non-deterministic results in parallel computations. First, we show how we restore accuracy and bitwise reproducibility algorithmically and programmatically. In particular, we adopt an implementation of the exactly rounded dot product based on long accumulators, which avoids accuracy losses especially in parallel applications. However, reproducibility and accuracy alone fail to indicate correct simulation behaviour. In fact, in the physical model slightly different initial conditions lead to vastly different end states. This behaviour translates to its numerical representation. Pointwise convergence, even in principle, becomes impossible for long simulation times. In a second part, we explore important performance tuning considerations. We identify latency and memory bandwidth as the main performance indicators of our routines. Based on these, we propose a parallel performance model that predicts the execution time of algorithms implemented in Feltor and test our model on a selection of parallel hardware architectures. We are able to predict the execution time with a relative error of less than 25% for problem sizes between 0.1 and 1000 MB. Finally, we find that the product of latency and bandwidth gives a minimum array size per compute node to achieve a scaling efficiency above 50% (both strong and weak)

    Towards Solving the Navier-Stokes Equation on Quantum Computers

    Get PDF
    In this paper, we explore the suitability of upcoming novel computing technologies, in particular adiabatic annealing based quantum computers, to solve fluid dynamics problems that form a critical component of several science and engineering applications. We start with simple flows with well-studied flow properties, and provide a framework to convert such systems to a form amenable for deployment on such quantum annealers. We analyze the solutions obtained both qualitatively and quantitatively as well as the sensitivities of the various solution selection schemes on the obtained solution

    Study of a navigation and traffic control technique employing satellites. Volume 3 - User hardware Interim report

    Get PDF
    User hardware configurations and requirements for navigation and air traffic control technique using satellite

    Numerical proof of stability of viscous shock profiles

    Full text link
    We carry out the first rigorous numerical proof based on Evans function computations of stability of viscous shock profiles, for the system of isentropic gas dynamics with monatomic equation of state. We treat a selection of shock strengths ranging from the lower stability boundary of Mach number ≈1.86\approx 1.86 , below which profiles are known by energy estimates to be stable, to the upper stability boundary of ≈1669\approx 1669, above which profiles are expected to be provable by rigorous asymptotic analysis to be stable. These results open the possibilities of: (i) automatic rigorous verification of stability or instability of individual shocks of general systems, and (ii) rigorous proof of stability of all shocks of particular systems.Comment: 13 pages, 10 figure

    Efficient computation of Hamiltonian matrix elements between non-orthogonal Slater determinants

    Get PDF
    We present an efficient numerical method for computing Hamiltonian matrix elements between non-orthogonal Slater determinants, focusing on the most time-consuming component of the calculation that involves a sparse array. In the usual case where many matrix elements should be calculated, this computation can be transformed into a multiplication of dense matrices. It is demonstrated that the present method based on the matrix-matrix multiplication attains ∼\sim80% of the theoretical peak performance measured on systems equipped with modern microprocessors, a factor of 5-10 better than the normal method using indirectly indexed arrays to treat a sparse array. The reason for such different performances is discussed from the viewpoint of memory access.Comment: 8 pages, 3 figure

    A GPGPU based program to solve the TDSE in intense laser fields through the finite difference approach

    Full text link
    We present a General-purpose computing on graphics processing units (GPGPU) based computational program and framework for the electronic dynamics of atomic systems under intense laser fields. We present our results using the case of hydrogen, however the code is trivially extensible to tackle problems within the single-active electron (SAE) approximation. Building on our previous work, we introduce the first available GPGPU based implementation of the Taylor, Runge-Kutta and Lanczos based methods created with strong field ab-initio simulations specifically in mind; CLTDSE. The code makes use of finite difference methods and the OpenCL framework for GPU acceleration. The specific example system used is the classic test system; Hydrogen. After introducing the standard theory, and specific quantities which are calculated, the code, including installation and usage, is discussed in-depth. This is followed by some examples and a short benchmark between an 8 hardware thread (i.e logical core) Intel Xeon CPU and an AMD 6970 GPU, where the parallel algorithm runs 10 times faster on the GPU than the CPU.Comment: 12 figure

    Cloud Computing - Architecture and Applications

    Full text link
    In the era of Internet of Things and with the explosive worldwide growth of electronic data volume, and associated need of processing, analysis, and storage of such humongous volume of data, it has now become mandatory to exploit the power of massively parallel architecture for fast computation. Cloud computing provides a cheap source of such computing framework for large volume of data for real-time applications. It is, therefore, not surprising to see that cloud computing has become a buzzword in the computing fraternity over the last decade. This book presents some critical applications in cloud frameworks along with some innovation design of algorithms and architecture for deployment in cloud environment. It is a valuable source of knowledge for researchers, engineers, practitioners, and graduate and doctoral students working in the field of cloud computing. It will also be useful for faculty members of graduate schools and universities.Comment: Edited Volume published by Intech Publishers, Croatia, June 2017. 138 pages. ISBN 978-953-51-3244-8, Print ISBN 978-953-51-3243-1. Link: https://www.intechopen.com/books/cloud-computing-architecture-and-application
    • …
    corecore