313 research outputs found
An overview of block Gram-Schmidt methods and their stability properties
Block Gram-Schmidt algorithms serve as essential kernels in many scientific
computing applications, but for many commonly used variants, a rigorous
treatment of their stability properties remains open. This survey provides a
comprehensive categorization of block Gram-Schmidt algorithms, particularly
those used in Krylov subspace methods to build orthonormal bases one block
vector at a time. All known stability results are assembled, and new results
are summarized or conjectured for important communication-reducing variants.
Additionally, new block versions of low-synchronization variants are derived,
and their efficacy and stability are demonstrated for a wide range of
challenging examples. Low-synchronization variants appear remarkably stable for
s-step-like matrices built with Newton polynomials, pointing towards a new
stable and efficient backbone for Krylov subspace methods. Numerical examples
are computed with a versatile MATLAB package hosted at
https://github.com/katlund/BlockStab, and scripts for reproducing all results
in the paper are provided. Block Gram-Schmidt implementations in popular
software packages are discussed, along with a number of open problems. An
appendix containing all algorithms type-set in a uniform fashion is provided.Comment: 42 pages, 5 tables, 17 figures, 20 algorithm
Benefits from using mixed precision computations in the ELPA-AEO and ESSEX-II eigensolver projects
We first briefly report on the status and recent achievements of the ELPA-AEO
(Eigenvalue Solvers for Petaflop Applications - Algorithmic Extensions and
Optimizations) and ESSEX II (Equipping Sparse Solvers for Exascale) projects.
In both collaboratory efforts, scientists from the application areas,
mathematicians, and computer scientists work together to develop and make
available efficient highly parallel methods for the solution of eigenvalue
problems. Then we focus on a topic addressed in both projects, the use of mixed
precision computations to enhance efficiency. We give a more detailed
description of our approaches for benefiting from either lower or higher
precision in three selected contexts and of the results thus obtained
ELSI: A Unified Software Interface for Kohn-Sham Electronic Structure Solvers
Solving the electronic structure from a generalized or standard eigenproblem
is often the bottleneck in large scale calculations based on Kohn-Sham
density-functional theory. This problem must be addressed by essentially all
current electronic structure codes, based on similar matrix expressions, and by
high-performance computation. We here present a unified software interface,
ELSI, to access different strategies that address the Kohn-Sham eigenvalue
problem. Currently supported algorithms include the dense generalized
eigensolver library ELPA, the orbital minimization method implemented in
libOMM, and the pole expansion and selected inversion (PEXSI) approach with
lower computational complexity for semilocal density functionals. The ELSI
interface aims to simplify the implementation and optimal use of the different
strategies, by offering (a) a unified software framework designed for the
electronic structure solvers in Kohn-Sham density-functional theory; (b)
reasonable default parameters for a chosen solver; (c) automatic conversion
between input and internal working matrix formats, and in the future (d)
recommendation of the optimal solver depending on the specific problem.
Comparative benchmarks are shown for system sizes up to 11,520 atoms (172,800
basis functions) on distributed memory supercomputing architectures.Comment: 55 pages, 14 figures, 2 table
A spectral scheme for Kohn-Sham density functional theory of clusters
Starting from the observation that one of the most successful methods for
solving the Kohn-Sham equations for periodic systems -- the plane-wave method
-- is a spectral method based on eigenfunction expansion, we formulate a
spectral method designed towards solving the Kohn-Sham equations for clusters.
This allows for efficient calculation of the electronic structure of clusters
(and molecules) with high accuracy and systematic convergence properties
without the need for any artificial periodicity. The basis functions in this
method form a complete orthonormal set and are expressible in terms of
spherical harmonics and spherical Bessel functions. Computation of the occupied
eigenstates of the discretized Kohn-Sham Hamiltonian is carried out using a
combination of preconditioned block eigensolvers and Chebyshev polynomial
filter accelerated subspace iterations. Several algorithmic and computational
aspects of the method, including computation of the electrostatics terms and
parallelization are discussed. We have implemented these methods and algorithms
into an efficient and reliable package called ClusterES (Cluster Electronic
Structure). A variety of benchmark calculations employing local and non-local
pseudopotentials are carried out using our package and the results are compared
to the literature. Convergence properties of the basis set are discussed
through numerical examples. Computations involving large systems that contain
thousands of electrons are demonstrated to highlight the efficacy of our
methodology. The use of our method to study clusters with arbitrary point group
symmetries is briefly discussed.Comment: Manuscript submitted (with revisions) to Journal of Computational
Physic
Explicit schemes for time propagating many-body wavefunctions
Accurate theoretical data on many time-dependent processes in atomic and
molecular physics and in chemistry require the direct numerical solution of the
time-dependent Schr\"odinger equation, thereby motivating the development of
very efficient time propagators. These usually involve the solution of very
large systems of first order differential equations that are characterized by a
high degree of stiffness. We analyze and compare the performance of the
explicit one-step algorithms of Fatunla and Arnoldi. Both algorithms have
exactly the same stability function, therefore sharing the same stability
properties that turn out to be optimum. Their respective accuracy however
differs significantly and depends on the physical situation involved. In order
to test this accuracy, we use a predictor-corrector scheme in which the
predictor is either Fatunla's or Arnoldi's algorithm and the corrector, a fully
implicit four-stage Radau IIA method of order 7. We consider two physical
processes. The first one is the ionization of an atomic system by a short and
intense electromagnetic pulse; the atomic systems include a one-dimensional
Gaussian model potential as well as atomic hydrogen and helium, both in full
dimensionality. The second process is the decoherence of two-electron quantum
states when a time independent perturbation is applied to a planar two-electron
quantum dot where both electrons are confined in an anharmonic potential. Even
though the Hamiltonian of this system is time independent the corresponding
differential equation shows a striking stiffness. For the one-dimensional
Gaussian potential we discuss in detail the possibility of monitoring the time
step for both explicit algorithms. In the other physical situations that are
much more demanding in term of computations, we show that the accuracy of both
algorithms depends strongly on the degree of stiffness of the problem.Comment: 24 pages, 14 Figure
Optimization Of Zonal Wavefront Estimation And Curvature Measurements
Optical testing in adverse environments, ophthalmology and applications where characterization by curvature is leveraged all have a common goal: accurately estimate wavefront shape. This dissertation investigates wavefront sensing techniques as applied to optical testing based on gradient and curvature measurements. Wavefront sensing involves the ability to accurately estimate shape over any aperture geometry, which requires establishing a sampling grid and estimation scheme, quantifying estimation errors caused by measurement noise propagation, and designing an instrument with sufficient accuracy and sensitivity for the application. Starting with gradient-based wavefront sensing, a zonal least-squares wavefront estimation algorithm for any irregular pupil shape and size is presented, for which the normal matrix equation sets share a pre-defined matrix. A GerchbergâSaxton iterative method is employed to reduce the deviation errors in the estimated wavefront caused by the pre-defined matrix across discontinuous boundary. The results show that the RMS deviation error of the estimated wavefront from the original wavefront can be less than λ/130~ λ/150 (for λ equals 632.8nm) after about twelve iterations and less than λ/100 after as few as four iterations. The presented approach to handling irregular pupil shapes applies equally well to wavefront estimation from curvature data. A defining characteristic for a wavefront estimation algorithm is its error propagation behavior. The error propagation coefficient can be formulated as a function of the eigenvalues of the wavefront estimation-related matrices, and such functions are established for each of the basic estimation geometries (i.e. Fried, Hudgin and Southwell) with a serial numbering scheme, where a square sampling grid array is sequentially indexed row by row. The results show that with the wavefront piston-value fixed, the odd-number grid sizes yield lower error propagation than the even-number grid sizes for all geometries. The Fried geometry either allows sub-sized wavefront estimations within the testing domain or yields a two-rank deficient estimation matrix over the full aperture; but the latter usually suffers from high error propagation and the waffle mode problem. Hudgin geometry offers an error propagator between those of the Southwell and the Fried geometries. For both wavefront gradient-based and wavefront difference-based estimations, the Southwell geometry is shown to offer the lowest error propagation with the minimum-norm least-squares solution. Nollâs theoretical result, which was extensively used as a reference in the previous literature for error propagation estimate, corresponds to the Southwell geometry with an odd-number grid size. For curvature-based wavefront sensing, a concept for a differential Shack-Hartmann (DSH) curvature sensor is proposed. This curvature sensor is derived from the basic Shack-Hartmann sensor with the collimated beam split into three output channels, along each of which a lenslet array is located. Three Hartmann grid arrays are generated by three lenslet arrays. Two of the lenslets shear in two perpendicular directions relative to the third one. By quantitatively comparing the Shack-Hartmann grid coordinates of the three channels, the differentials of the wavefront slope at each Shack-Hartmann grid point can be obtained, so the Laplacian curvatures and twist terms will be available. The acquisition of the twist terms using a Hartmann-based sensor allows us to uniquely determine the principal curvatures and directions more accurately than prior methods. Measurement of local curvatures as opposed to slopes is unique because curvature is intrinsic to the wavefront under test, and it is an absolute as opposed to a relative measurement. A zonal least-squares-based wavefront estimation algorithm was developed to estimate the wavefront shape from the Laplacian curvature data, and validated. An implementation of the DSH curvature sensor is proposed and an experimental system for this implementation was initiated. The DSH curvature sensor shares the important features of both the Shack-Hartmann slope sensor and Roddierâs curvature sensor. It is a two-dimensional parallel curvature sensor. Because it is a curvature sensor, it provides absolute measurements which are thus insensitive to vibrations, tip/tilts, and whole body movements. Because it is a two-dimensional sensor, it does not suffer from other sources of errors, such as scanning noise. Combined with sufficient sampling and a zonal wavefront estimation algorithm, both low and mid frequencies of the wavefront may be recovered. Notice that the DSH curvature sensor operates at the pupil of the system under test, therefore the difficulty associated with operation close to the caustic zone is avoided. Finally, the DSH-curvature-sensor-based wavefront estimation does not suffer from the 2Ï-ambiguity problem, so potentially both small and large aberrations may be measured
- âŠ