5,714 research outputs found
Efficient solution of parabolic equations by Krylov approximation methods
Numerical techniques for solving parabolic equations by the method of lines is addressed. The main motivation for the proposed approach is the possibility of exploiting a high degree of parallelism in a simple manner. The basic idea of the method is to approximate the action of the evolution operator on a given state vector by means of a projection process onto a Krylov subspace. Thus, the resulting approximation consists of applying an evolution operator of a very small dimension to a known vector which is, in turn, computed accurately by exploiting well-known rational approximations to the exponential. Because the rational approximation is only applied to a small matrix, the only operations required with the original large matrix are matrix-by-vector multiplications, and as a result the algorithm can easily be parallelized and vectorized. Some relevant approximation and stability issues are discussed. We present some numerical experiments with the method and compare its performance with a few explicit and implicit algorithms
A Time Dependent Multi-Determinant approach to nuclear dynamics
We study a multi-determinant approach to the time evolution of the nuclear
wave functions (TDMD). We employ the Dirac variational principle and use as
anzatz for the nuclear wave-function a linear combination of Slater
determinants and derive the equations of motion. We demonstrate explicitly that
the norm of the wave function and the energy are conserved during the time
evolution. This approach is a direct generalization of the time dependent
Hartree-Fock method. We apply this approach to a case study of using
the N3LO interaction renormalized to 4 major harmonic oscillator shells. We
solve the TDMD equations of motion using Krylov subspace methods of Lanczos
type. We discuss as an application the isoscalar monopole strength function.Comment: 38 pages, additional calculations included. Accepted for publication,
Int. J. of Mod. Phys.
Similarity-Aware Spectral Sparsification by Edge Filtering
In recent years, spectral graph sparsification techniques that can compute
ultra-sparse graph proxies have been extensively studied for accelerating
various numerical and graph-related applications. Prior nearly-linear-time
spectral sparsification methods first extract low-stretch spanning tree from
the original graph to form the backbone of the sparsifier, and then recover
small portions of spectrally-critical off-tree edges to the spanning tree to
significantly improve the approximation quality. However, it is not clear how
many off-tree edges should be recovered for achieving a desired spectral
similarity level within the sparsifier. Motivated by recent graph signal
processing techniques, this paper proposes a similarity-aware spectral graph
sparsification framework that leverages efficient spectral off-tree edge
embedding and filtering schemes to construct spectral sparsifiers with
guaranteed spectral similarity (relative condition number) level. An iterative
graph densification scheme is introduced to facilitate efficient and effective
filtering of off-tree edges for highly ill-conditioned problems. The proposed
method has been validated using various kinds of graphs obtained from public
domain sparse matrix collections relevant to VLSI CAD, finite element analysis,
as well as social and data networks frequently studied in many machine learning
and data mining applications
Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures
Abstract. Based on the premise that preconditioners needed for scien-tific computing are not only required to be robust in the numerical sense, but also scalable for up to thousands of light-weight cores, we argue that this two-fold goal is achieved for the recently developed self-adaptive multi-elimination preconditioner. For this purpose, we revise the under-lying idea and analyze the performance of implementations realized in the PARALUTION and MAGMA open-source software libraries on GPU architectures (using either CUDA or OpenCL), Intel’s Many Integrated Core Architecture, and Intel’s Sandy Bridge processor. The comparison with other well-established preconditioners like multi-coloured Gauss-Seidel, ILU(0) and multi-colored ILU(0), shows that the twofold goal of a numerically stable cross-platform performant algorithm is achieved.
Improved numerical methods for infinite spin chains with long-range interactions
We present several improvements of the infinite matrix product state (iMPS)
algorithm for finding ground states of one-dimensional quantum systems with
long-range interactions. As a main new ingredient we introduce the superposed
multi-optimization (SMO) method, which allows an efficient optimization of
exponentially many MPS of different length at different sites all in one step.
Hereby the algorithm becomes protected against position dependent effects as
caused by spontaneously broken translational invariance. So far, these have
been a major obstacle to convergence for the iMPS algorithm if no prior
knowledge of the systems translational symmetry was accessible. Further, we
investigate some more general methods to speed up calculations and improve
convergence, which might be partially interesting in a much broader context,
too. As a more special problem, we also look into translational invariant
states close to an invariance braking phase transition and show how to avoid
convergence into wrong local minima for such systems. Finally, we apply the new
methods to polar bosons with long-range interactions. We calculate several
detailed Devil's Staircases with the corresponding phase diagrams and
investigate some supersolid properties.Comment: Main text: 17 pages plus references, 8 figures. Supplementary info: 6
pages. v2: improved presentation and more results adde
Linear response strength functions with iterative Arnoldi diagonalization
We report on an implementation of a new method to calculate RPA strength
functions with iterative non-hermitian Arnoldi diagonalization method, which
does not explicitly calculate and store the RPA matrix. We discuss the
treatment of spurious modes, numerical stability, and how the method scales as
the used model space is enlarged. We perform the particle-hole RPA benchmark
calculations for double magic nucleus 132Sn and compare the resulting
electromagnetic strength functions against those obtained within the standard
RPA.Comment: 9 RevTeX pages, 11 figures, submitted to Physical Review
Nonequilibrium electron transport using the density matrix renormalization group
We extended the Density Matrix Renormalization Group method to study the real
time dynamics of interacting one dimensional spinless Fermi systems by applying
the full time evolution operator to an initial state. As an example we describe
the propagation of a density excitation in an interacting clean system and the
transport through an interacting nano structure
Targeted Excited State Algorithms
To overcome the limitations of the traditional state-averaging approaches in
excited state calculations, where one solves for and represents all states
between the ground state and excited state of interest, we have investigated a
number of new excited state algorithms. Building on the work of van der Vorst
and Sleijpen (SIAM J. Matrix Anal. Appl., 17, 401 (1996)), we have implemented
Harmonic Davidson and State-Averaged Harmonic Davidson algorithms within the
context of the Density Matrix Renormalization Group (DMRG). We have assessed
their accuracy and stability of convergence in complete active space DMRG
calculations on the low-lying excited states in the acenes ranging from
naphthalene to pentacene. We find that both algorithms offer increased accuracy
over the traditional State-Averaged Davidson approach, and in particular, the
State-Averaged Harmonic Davidson algorithm offers an optimal combination of
accuracy and stability in convergence
Computing and deflating eigenvalues while solving multiple right hand side linear systems in Quantum Chromodynamics
We present a new algorithm that computes eigenvalues and eigenvectors of a
Hermitian positive definite matrix while solving a linear system of equations
with Conjugate Gradient (CG). Traditionally, all the CG iteration vectors could
be saved and recombined through the eigenvectors of the tridiagonal projection
matrix, which is equivalent theoretically to unrestarted Lanczos. Our algorithm
capitalizes on the iteration vectors produced by CG to update only a small
window of vectors that approximate the eigenvectors. While this window is
restarted in a locally optimal way, the CG algorithm for the linear system is
unaffected. Yet, in all our experiments, this small window converges to the
required eigenvectors at a rate identical to unrestarted Lanczos. After the
solution of the linear system, eigenvectors that have not accurately converged
can be improved in an incremental fashion by solving additional linear systems.
In this case, eigenvectors identified in earlier systems can be used to
deflate, and thus accelerate, the convergence of subsequent systems. We have
used this algorithm with excellent results in lattice QCD applications, where
hundreds of right hand sides may be needed. Specifically, about 70 eigenvectors
are obtained to full accuracy after solving 24 right hand sides. Deflating
these from the large number of subsequent right hand sides removes the dreaded
critical slowdown, where the conditioning of the matrix increases as the quark
mass reaches a critical value. Our experiments show almost a constant number of
iterations for our method, regardless of quark mass, and speedups of 8 over
original CG for light quark masses.Comment: 22 pages, 26 eps figure
- …