40 research outputs found
A multishift, multipole rational QZ method with aggressive early deflation
The rational QZ method generalizes the QZ method by implicitly supporting
rational subspace iteration. In this paper we extend the rational QZ method by
introducing shifts and poles of higher multiplicity in the Hessenberg pencil,
which is a pencil consisting of two Hessenberg matrices. The result is a
multishift, multipole iteration on block Hessenberg pencils which allows one to
stick to real arithmetic for a real input pencil. In combination with optimally
packed shifts and aggressive early deflation as an advanced deflation technique
we obtain an efficient method for the dense generalized eigenvalue problem. In
the numerical experiments we compare the results with state-of-the-art routines
for the generalized eigenvalue problem and show that we are competitive in
terms of speed and accuracy
On pole-swapping algorithms for the eigenvalue problem
Pole-swapping algorithms, which are generalizations of the QZ algorithm for
the generalized eigenvalue problem, are studied. A new modular (and therefore
more flexible) convergence theory that applies to all pole-swapping algorithms
is developed. A key component of all such algorithms is a procedure that swaps
two adjacent eigenvalues in a triangular pencil. An improved swapping routine
is developed, and its superiority over existing methods is demonstrated by a
backward error analysis and numerical tests. The modularity of the new
convergence theory and the generality of the pole-swapping approach shed new
light on bi-directional chasing algorithms, optimally packed shifts, and bulge
pencils, and allow the design of novel algorithms
A rational QZ method
We propose a rational QZ method for the solution of the dense, unsymmetric
generalized eigenvalue problem. This generalization of the classical QZ method
operates implicitly on a Hessenberg, Hessenberg pencil instead of on a
Hessenberg, triangular pencil. Whereas the QZ method performs nested subspace
iteration driven by a polynomial, the rational QZ method allows for nested
subspace iteration driven by a rational function, this creates the additional
freedom of selecting poles. In this article we study Hessenberg, Hessenberg
pencils, link them to rational Krylov subspaces, propose a direct reduction
method to such a pencil, and introduce the implicit rational QZ step. The link
with rational Krylov subspaces allows us to prove essential uniqueness
(implicit Q theorem) of the rational QZ iterates as well as convergence of the
proposed method. In the proofs, we operate directly on the pencil instead of
rephrasing it all in terms of a single matrix. Numerical experiments are
included to illustrate competitiveness in terms of speed and accuracy with the
classical approach. Two other types of experiments exemplify new possibilities.
First we illustrate that good pole selection can be used to deflate the
original problem during the reduction phase, and second we use the rational QZ
method to implicitly filter a rational Krylov subspace in an iterative method
Block Tridiagonal Reduction of Perturbed Normal and Rank Structured Matrices
It is well known that if a matrix solves the
matrix equation , where is a linear bivariate polynomial,
then is normal; and can be simultaneously reduced in a finite
number of operations to tridiagonal form by a unitary congruence and, moreover,
the spectrum of is located on a straight line in the complex plane. In this
paper we present some generalizations of these properties for almost normal
matrices which satisfy certain quadratic matrix equations arising in the study
of structured eigenvalue problems for perturbed Hermitian and unitary matrices.Comment: 13 pages, 3 figure
A Householder-based algorithm for Hessenberg-triangular reduction
The QZ algorithm for computing eigenvalues and eigenvectors of a matrix
pencil requires that the matrices first be reduced to
Hessenberg-triangular (HT) form. The current method of choice for HT reduction
relies entirely on Givens rotations regrouped and accumulated into small dense
matrices which are subsequently applied using matrix multiplication routines. A
non-vanishing fraction of the total flop count must nevertheless still be
performed as sequences of overlapping Givens rotations alternately applied from
the left and from the right. The many data dependencies associated with this
computational pattern leads to inefficient use of the processor and poor
scalability.
In this paper, we therefore introduce a fundamentally different approach that
relies entirely on (large) Householder reflectors partially accumulated into
block reflectors, by using (compact) WY representations. Even though the new
algorithm requires more floating point operations than the state of the art
algorithm, extensive experiments on both real and synthetic data indicate that
it is still competitive, even in a sequential setting. The new algorithm is
conjectured to have better parallel scalability, an idea which is partially
supported by early small-scale experiments using multi-threaded BLAS. The
design and evaluation of a parallel formulation is future work