14,241 research outputs found
High-Performance Solvers for Dense Hermitian Eigenproblems
We introduce a new collection of solvers - subsequently called EleMRRR - for
large-scale dense Hermitian eigenproblems. EleMRRR solves various types of
problems: generalized, standard, and tridiagonal eigenproblems. Among these,
the last is of particular importance as it is a solver on its own right, as
well as the computational kernel for the first two; we present a fast and
scalable tridiagonal solver based on the Algorithm of Multiple Relatively
Robust Representations - referred to as PMRRR. Like the other EleMRRR solvers,
PMRRR is part of the freely available Elemental library, and is designed to
fully support both message-passing (MPI) and multithreading parallelism (SMP).
As a result, the solvers can equally be used in pure MPI or in hybrid MPI-SMP
fashion. We conducted a thorough performance study of EleMRRR and ScaLAPACK's
solvers on two supercomputers. Such a study, performed with up to 8,192 cores,
provides precise guidelines to assemble the fastest solver within the ScaLAPACK
framework; it also indicates that EleMRRR outperforms even the fastest solvers
built from ScaLAPACK's components
A spectral scheme for Kohn-Sham density functional theory of clusters
Starting from the observation that one of the most successful methods for
solving the Kohn-Sham equations for periodic systems -- the plane-wave method
-- is a spectral method based on eigenfunction expansion, we formulate a
spectral method designed towards solving the Kohn-Sham equations for clusters.
This allows for efficient calculation of the electronic structure of clusters
(and molecules) with high accuracy and systematic convergence properties
without the need for any artificial periodicity. The basis functions in this
method form a complete orthonormal set and are expressible in terms of
spherical harmonics and spherical Bessel functions. Computation of the occupied
eigenstates of the discretized Kohn-Sham Hamiltonian is carried out using a
combination of preconditioned block eigensolvers and Chebyshev polynomial
filter accelerated subspace iterations. Several algorithmic and computational
aspects of the method, including computation of the electrostatics terms and
parallelization are discussed. We have implemented these methods and algorithms
into an efficient and reliable package called ClusterES (Cluster Electronic
Structure). A variety of benchmark calculations employing local and non-local
pseudopotentials are carried out using our package and the results are compared
to the literature. Convergence properties of the basis set are discussed
through numerical examples. Computations involving large systems that contain
thousands of electrons are demonstrated to highlight the efficacy of our
methodology. The use of our method to study clusters with arbitrary point group
symmetries is briefly discussed.Comment: Manuscript submitted (with revisions) to Journal of Computational
Physic
Research and Education in Computational Science and Engineering
Over the past two decades the field of computational science and engineering
(CSE) has penetrated both basic and applied research in academia, industry, and
laboratories to advance discovery, optimize systems, support decision-makers,
and educate the scientific and engineering workforce. Informed by centuries of
theory and experiment, CSE performs computational experiments to answer
questions that neither theory nor experiment alone is equipped to answer. CSE
provides scientists and engineers of all persuasions with algorithmic
inventions and software systems that transcend disciplines and scales. Carried
on a wave of digital technology, CSE brings the power of parallelism to bear on
troves of data. Mathematics-based advanced computing has become a prevalent
means of discovery and innovation in essentially all areas of science,
engineering, technology, and society; and the CSE community is at the core of
this transformation. However, a combination of disruptive
developments---including the architectural complexity of extreme-scale
computing, the data revolution that engulfs the planet, and the specialization
required to follow the applications to new frontiers---is redefining the scope
and reach of the CSE endeavor. This report describes the rapid expansion of CSE
and the challenges to sustaining its bold advances. The report also presents
strategies and directions for CSE research and education for the next decade.Comment: Major revision, to appear in SIAM Revie
A direct solver with O(N) complexity for variable coefficient elliptic PDEs discretized via a high-order composite spectral collocation method
A numerical method for solving elliptic PDEs with variable coefficients on
two-dimensional domains is presented. The method is based on high-order
composite spectral approximations and is designed for problems with smooth
solutions. The resulting system of linear equations is solved using a direct
(as opposed to iterative) solver that has optimal O(N) complexity for all
stages of the computation when applied to problems with non-oscillatory
solutions such as the Laplace and the Stokes equations. Numerical examples
demonstrate that the scheme is capable of computing solutions with relative
accuracy of or better, even for challenging problems such as highly
oscillatory Helmholtz problems and convection-dominated convection diffusion
equations. In terms of speed, it is demonstrated that a problem with a
non-oscillatory solution that was discretized using nodes was solved
in 115 minutes on a personal work-station with two quad-core 3.3GHz CPUs. Since
the solver is direct, and the "solution operator" fits in RAM, any solves
beyond the first are very fast. In the example with unknowns, solves
require only 30 seconds.Comment: arXiv admin note: text overlap with arXiv:1302.599
- …