Search CORE

15,875 research outputs found

Comparision of direct and Fourier space techniques in time-dependent density functional theory

Author: Bertsch G. F.
Rubio Angel
Yabana K.
Publication venue
Publication date: 11/04/2000
Field of study

Several techniques have appeared in the literatuare to solve the equations of time-dependent density functional theory. We compare the efficiency of different methods based on mesh representations of the wave function (direct and Fourier space), taking as a test case the calculation of the surface plasmon in the cluster Na_8. For smaller systems, the methods have comparable efficiency. For large systems the direct time method has a decided advantage in computer storage requirements. It is also more economical on arithmetic operations, but is not as suited for parallel computing as the methods based on a frequency representation.Comment: 20 pages, TeX (or Latex, etc), 5 tables and no figures; revised version with new abstrac

arXiv.org e-Print Archive

Solving the global atmospheric equations through heterogeneous reconfigurable platforms

Author: Fu H
Gan L
Huang X
Luk W
Xue W
Yang C
Yang G
Zhang Y
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/03/2014
Field of study

Spiral - Imperial College Digital Repository

Recommended from our members

Towards Solving the Navier-Stokes Equation on Quantum Computers

Author: Banerjee Tirtha
Karra Satish
Nadiga Balasubramanya
Ray Navamita
Publication venue: eScholarship, University of California
Publication date: 16/04/2019
Field of study

In this paper, we explore the suitability of upcoming novel computing technologies, in particular adiabatic annealing based quantum computers, to solve fluid dynamics problems that form a critical component of several science and engineering applications. We start with simple flows with well-studied flow properties, and provide a framework to convert such systems to a form amenable for deployment on such quantum annealers. We analyze the solutions obtained both qualitatively and quantitatively as well as the sensitivities of the various solution selection schemes on the obtained solution

eScholarship - University of California

Reproducibility, accuracy and performance of the Feltor code and library on parallel computer architectures

Author: Einkemmer Lukas
Gutierrez-Milla Albert
Held Markus
Iakymchuk Roman
Saez Xavier
Wiesenberger Matthias
Publication venue: 'Elsevier BV'
Publication date: 03/11/2018
Field of study

Feltor is a modular and free scientific software package. It allows developing platform independent code that runs on a variety of parallel computer architectures ranging from laptop CPUs to multi-GPU distributed memory systems. Feltor consists of both a numerical library and a collection of application codes built on top of the library. Its main target are two- and three-dimensional drift- and gyro-fluid simulations with discontinuous Galerkin methods as the main numerical discretization technique. We observe that numerical simulations of a recently developed gyro-fluid model produce non-deterministic results in parallel computations. First, we show how we restore accuracy and bitwise reproducibility algorithmically and programmatically. In particular, we adopt an implementation of the exactly rounded dot product based on long accumulators, which avoids accuracy losses especially in parallel applications. However, reproducibility and accuracy alone fail to indicate correct simulation behaviour. In fact, in the physical model slightly different initial conditions lead to vastly different end states. This behaviour translates to its numerical representation. Pointwise convergence, even in principle, becomes impossible for long simulation times. In a second part, we explore important performance tuning considerations. We identify latency and memory bandwidth as the main performance indicators of our routines. Based on these, we propose a parallel performance model that predicts the execution time of algorithms implemented in Feltor and test our model on a selection of parallel hardware architectures. We are able to predict the execution time with a relative error of less than 25% for problem sizes between 0.1 and 1000 MB. Finally, we find that the product of latency and bandwidth gives a minimum array size per compute node to achieve a scaling efficiency above 50% (both strong and weak)

arXiv.org e-Print Archive

Towards Solving the Navier-Stokes Equation on Quantum Computers

Author: Banerjee Tirtha
Karra Satish
Nadiga Balasubramanya
Ray Navamita
Publication venue
Publication date: 16/04/2019
Field of study

arXiv.org e-Print Archive

eScholarship - University of California

Study of a navigation and traffic control technique employing satellites. Volume 3 - User hardware Interim report

Author: Estersohn N.
Garabedian A.
Publication venue
Publication date
Field of study

User hardware configurations and requirements for navigation and air traffic control technique using satellite

Numerical proof of stability of viscous shock profiles

Author: Barker Blake
Zumbrun Kevin
Publication venue
Publication date: 05/01/2016
Field of study

We carry out the first rigorous numerical proof based on Evans function computations of stability of viscous shock profiles, for the system of isentropic gas dynamics with monatomic equation of state. We treat a selection of shock strengths ranging from the lower stability boundary of Mach number

\approx 1.86

, below which profiles are known by energy estimates to be stable, to the upper stability boundary of

\approx 1669

, above which profiles are expected to be provable by rigorous asymptotic analysis to be stable. These results open the possibilities of: (i) automatic rigorous verification of stability or instability of individual shocks of general systems, and (ii) rigorous proof of stability of all shocks of particular systems.Comment: 13 pages, 10 figure

arXiv.org e-Print Archive

Efficient computation of Hamiltonian matrix elements between non-orthogonal Slater determinants

Author: Bender
Bender
Bohr
Dongarra
Dongarra
Dongarra
Dongarra
Griffin
Hill
Koch
Lawson
Morgon
Noritaka Shimizu
Otsuka
Puddu
Ring
Robledo
Sabbey
Sarkar
Schmid
Scuseria
Shimizu
Takaharu Otsuka
Takashi Abe
Thouless
Tomita
Wadleigh
Wick
Yutaka Utsuno
Publication venue: 'Elsevier BV'
Publication date: 18/09/2012
Field of study

We present an efficient numerical method for computing Hamiltonian matrix elements between non-orthogonal Slater determinants, focusing on the most time-consuming component of the calculation that involves a sparse array. In the usual case where many matrix elements should be calculated, this computation can be transformed into a multiplication of dense matrices. It is demonstrated that the present method based on the matrix-matrix multiplication attains

\sim

80% of the theoretical peak performance measured on systems equipped with modern microprocessors, a factor of 5-10 better than the normal method using indirectly indexed arrays to treat a sparse array. The reason for such different performances is discussed from the viewpoint of memory access.Comment: 8 pages, 3 figure

arXiv.org e-Print Archive

A GPGPU based program to solve the TDSE in intense laser fields through the finite difference approach

Author: Broin Cathal Ó
Nikolopoulos L. A. A
Publication venue: 'Elsevier BV'
Publication date: 08/08/2013
Field of study

We present a General-purpose computing on graphics processing units (GPGPU) based computational program and framework for the electronic dynamics of atomic systems under intense laser fields. We present our results using the case of hydrogen, however the code is trivially extensible to tackle problems within the single-active electron (SAE) approximation. Building on our previous work, we introduce the first available GPGPU based implementation of the Taylor, Runge-Kutta and Lanczos based methods created with strong field ab-initio simulations specifically in mind; CLTDSE. The code makes use of finite difference methods and the OpenCL framework for GPU acceleration. The specific example system used is the classic test system; Hydrogen. After introducing the standard theory, and specific quantities which are calculated, the code, including installation and usage, is discussed in-depth. This is followed by some examples and a short benchmark between an 8 hardware thread (i.e logical core) Intel Xeon CPU and an AMD 6970 GPU, where the parallel algorithm runs 10 times faster on the GPU than the CPU.Comment: 12 figure

arXiv.org e-Print Archive

Cloud Computing - Architecture and Applications

Author: Alasaad Amr
Alfakhri Abdullah
Alshebeili Saleh
Andreev Sergey
Ashraf Muhammad Ahmad
Florea Roman
Koucheryavy Yevgeni
Long Yun
Ometov Aleksandr
Sen Jaydip
Sethi Waleed Tariq
Surak Adam
Wang Jian
Wang Xiaoying
Yang Mengqin
Zhang Guojing
Zhao Shanrong
Publication venue: 'IntechOpen'
Publication date: 29/07/2017
Field of study

In the era of Internet of Things and with the explosive worldwide growth of electronic data volume, and associated need of processing, analysis, and storage of such humongous volume of data, it has now become mandatory to exploit the power of massively parallel architecture for fast computation. Cloud computing provides a cheap source of such computing framework for large volume of data for real-time applications. It is, therefore, not surprising to see that cloud computing has become a buzzword in the computing fraternity over the last decade. This book presents some critical applications in cloud frameworks along with some innovation design of algorithms and architecture for deployment in cloud environment. It is a valuable source of knowledge for researchers, engineers, practitioners, and graduate and doctoral students working in the field of cloud computing. It will also be useful for faculty members of graduate schools and universities.Comment: Edited Volume published by Intech Publishers, Croatia, June 2017. 138 pages. ISBN 978-953-51-3244-8, Print ISBN 978-953-51-3243-1. Link: https://www.intechopen.com/books/cloud-computing-architecture-and-application

arXiv.org e-Print Archive