12 research outputs found

    Multishift variants of the QZ algorithm with aggressive early deflation

    Get PDF
    New variants of the QZ algorithm for solving the generalized eigenvalue problem are proposed. An extension of the small-bulge multishift QR algorithm is developed, which chases chains of many small bulges instead of only one bulge in each QZ iteration. This allows the effective use of level 3 BLAS operations, which in turn can provide efficient utilization of high performance computing systems with deep memory hierarchies. Moreover, an extension of the aggressive early deflation strategy is proposed, which can identify and de. ate converged eigenvalues long before classic deflation strategies would. Consequently, the number of overall QZ iterations needed until convergence is considerably reduced. As a third ingredient, we reconsider the deflation of infinite eigenvalues and present a new deflation algorithm, which is particularly effective in the presence of a large number of infinite eigenvalues. Combining all these developments, our implementation significantly improves existing implementations of the QZ algorithm. This is demonstrated by numerical experiments with random matrix pairs as well as with matrix pairs arising from various applications

    A new deflation criterion for the QZ algorithm

    Full text link
    The QZ algorithm computes the Schur form of a matrix pencil. It is an iterative algorithm and at some point, it must decide that an eigenvalue has converged and move on with another one. Choosing a criterion that makes this decision is nontrivial. If it is too strict, the algorithm might waste iterations on already converged eigenvalues. If it is not strict enough, the computed eigenvalues might be inaccurate. Additionally, the criterion should not be computationally expensive to evaluate. This paper introduces a new criterion based on the size of and the gap between the eigenvalues. This is similar to the work of Ahues and Tissuer for the QR algorithm. Theoretical arguments and numerical experiments suggest that it outperforms the most popular criteria in terms of accuracy. Additionally, this paper evaluates some commonly used criteria for infinite eigenvalues.Comment: 10 pages, 6 figure

    A multishift, multipole rational QZ method with aggressive early deflation

    Full text link
    The rational QZ method generalizes the QZ method by implicitly supporting rational subspace iteration. In this paper we extend the rational QZ method by introducing shifts and poles of higher multiplicity in the Hessenberg pencil, which is a pencil consisting of two Hessenberg matrices. The result is a multishift, multipole iteration on block Hessenberg pencils which allows one to stick to real arithmetic for a real input pencil. In combination with optimally packed shifts and aggressive early deflation as an advanced deflation technique we obtain an efficient method for the dense generalized eigenvalue problem. In the numerical experiments we compare the results with state-of-the-art routines for the generalized eigenvalue problem and show that we are competitive in terms of speed and accuracy

    A Novel Parallel QR Algorithm For Hybrid Distributed Memory HPC Systems

    Get PDF
    A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on hybrid distributed high performance computing systems is presented. For this purpose, we introduce the concept of multiwindow bulge chain chasing and parallelize aggressive early deflation. The multiwindow approach ensures that most computations when chasing chains of bulges are performed in level 3 BLAS operations, while the aim of aggressive early deflation is to speed up the convergence of the QR algorithm. Mixed MPI-OpenMP coding techniques are utilized for porting the codes to distributed memory platforms with multithreaded nodes, such as multicore processors. Numerous numerical experiments confirm the superior performance of our parallel QR algorithm in comparison with the existing ScaLAPACK code, leading to an implementation that is one to two orders of magnitude faster for sufficiently large problems, including a number of examples from applications

    A parallel Schur method for solving continuous-time algebraic Riccati equations

    Get PDF
    Numerical algorithms for solving the continuous-time algebraic Riccati matrix equation on a distributed memory parallel computer are considered. In particular, it is shown that the Schur method, based on computing the stable invariant subspace of a Hamiltonian matrix, can be parallelized in an efficient and scalable way. Our implementation employs the state-of-the-art library ScaLAPACK as well as recently developed parallel methods for reordering the eigenvalues in a real Schur form. Some experimental results are presented, confirming the scalability of our implementation and comparing it with an existing implementation of the matrix sign iteration from the PLiCOC library

    Parallel eigenvalue reordering in real Schur forms

    Get PDF
    A parallel algorithm for reordering the eigenvalues in the real Schur form of a matrix is presented and discussed. Our novel approach adopts computational windows and delays multiple outside-window updates until each window has been completely reordered locally. By using multiple concurrent windows the parallel algorithm has a high level of concurrency, and most work is level 3 BLAS operations. The presented algorithm is also extended to the generalized real Schur form. Experimental results for ScaLAPACK-style Fortran 77 implementations on a Linux cluster confirm the efficiency and scalability of our algorithms in terms of more than 16 times of parallel speedup using 64 processors for large-scale problems. Even on a single processor our implementation is demonstrated to perform significantly better compared with the state-of-the-art serial implementation. Copyright (C) 2009 John Wiley & Sons, Ltd

    Algoritmos paralelos para la reducci贸n de sistemas lineales de control estables

    Full text link
    [EN] In the field of control theory, sometimes system models of big size (with many state variables) appear. When one of these high order systems needs to be simulated, studied or controlled, it is convenient to perform a previous work of model reduction in order to reduce the necessary (economic and temporal) costs. This process of obtaining a low order adequate representation of the original system usually has a high cost, because it has to be done with the original unreduced system. Thus, it is important to have high performance implementations for the problem of reducing linear control systems. In this thesis high performance implementations for some methods of model reduction have been developed. Current algorithms for model reduction of stable linear control systems and their implementation in the control library SLICOT have been analysed. New parallel algorithms for the methods strongly based on solving Lyapunov equations have been proposed. The new developed routines are incorporated in the high performance library for control PSLICOT. Apart from the main functions in charge of model reduction, all operations appearing in the problem and not having a high performance version yet have also been parallelised. One of these operations is the solution of Lyapunov equations in standard form. Parallel routines for solving these equations have been developed. These routines solve the equation obtaining directly the Cholesky factor of the solution, which fits better their application in model reduction. For this, Hammarling's method has been parallelised. The new routines solve in parallel and for dense matrices the four possible variants of standard Lyapunov equations: discrete and continuous versions, both transposed and not transposed. Interfaces offered by all the parallelised routines are similar to that of the existing routines in ScaLAPACK library, so they are easy to use from a user of this kind of libraries. The new routines work with the same data distribution used in this library: 2D block cyclic distribution, which allows many other distributions. Thanks to the developed work, now there are available high performance parallel routines to reduce linear control systems by using different variants of the Square-Root Balance & Truncate model reduction method: with or without balancing and with or without using the singular perturbation approximation formulas. They are parallel implementations of the same algorithms and methods used in the sequential routines of the SLICOT library. This allows to efficiently reduce models of linear control systems of big size. Moreover, existing software in ScaLAPACK for the eigenvalue problem has been improved by covering cases not treated there: the solution of the generalised problem (by transforming it into standard form, which is not always possible) and the computation of the eigenvectors. This part of the work has been applied to a real problem of simulation of oceanic flows. Here, it is necessary to compute all the eigenvalues and eigenvectors of a generalised eigenvalue problem with a very big dimension. As a consequence, enormous eigenvalue problems have been solved (with matrices of order greater than 400000), that could not be solved previously. Solving them allows to improve the precision in the studies of the behaviour of oceanic flows.[ES] En el campo de la teor铆a de control en ocasiones aparecen modelos de sistemas con un tama帽o elevado (muchas variables de estado). Cuando se pretende simular, estudiar o controlar uno de estos sistemas de orden elevado, conviene realizar un trabajo previo de reducci贸n del modelo del sistema con el prop贸sito de reducir los costes (econ贸micos/temporales) necesarios en un tratamiento posterior. El proceso de obtenci贸n de un sistema de orden reducido que represente adecuadamente el sistema original suele ser costoso, ya que necesariamente se tiene que hacer con el sistema original sin reducir. Por esto, resulta conveniente disponer de implementaciones de altas prestaciones para el problema de reducci贸n de sistemas lineales de control. En esta tesis se han desarrollado implementaciones de altas prestaciones para algunos m茅todos de reducci贸n de modelos. Se han analizado los algoritmos existentes para la reducci贸n de modelos de sistemas lineales de control estables y sus implementaciones en la librer铆a de control SLICOT. Se han propuesto nuevos algoritmos paralelos para los m茅todos cuyo n煤cleo principal es la resoluci贸n de ecuaciones de Lyapunov. Las nuevas rutinas desarrolladas se incorporan a la librer铆a de computaci贸n de altas prestaciones para control PSLICOT. Aparte de las funciones principales a cargo de la reducci贸n de modelos, se han tenido que paralelizar tambi茅n todas aquellas operaciones num茅ricas que aparecen en este problema y de las que no se dispon铆a de versiones de altas prestaciones. De estas operaciones, cabe destacar rutinas paralelas para la resoluci贸n de la ecuaci贸n de Lyapunov en su forma est谩ndar obteniendo directamente el factor de Cholesky de la soluci贸n, que es lo que se necesita para la reducci贸n de modelos. El m茅todo utilizado es una versi贸n paralela del m茅todo de Hammarling. Las rutinas implementadas resuelven en paralelo y para matrices densas las cuatro variantes posibles de la ecuaci贸n de Lyapunov: en su forma discreta y continua, traspuestas y sin trasponer. Todas las rutinas paralelizadas ofrecen una interfaz como la de las rutinas de la librer铆a ScaLAPACK, para que puedan ser usadas con facilidad por el usuario habituado a trabajar con este tipo de librer铆as. Se permiten las mismas distribuciones de datos que en esta librer铆a: una distribuci贸n c铆clica 2D por bloques, que engloba muchas otras distribuciones. Gracias al trabajo desarrollado, ahora se dispone de versiones paralelas de altas prestaciones para reducir sistemas lineales de control mediante diferentes variantes del m茅todo de balanceado y truncamiento de la ra铆z cuadrada (the Square-Root Balance & Truncate model reduction method): con o sin balanceado y con o sin usar las f贸rmulas de perturbaci贸n singular. Se trata de versiones paralelas de los mismos algoritmos y m茅todos que se utilizan en las versiones secuenciales de la librer铆a SLICOT. Esto permitir谩 reducir de forma eficiente modelos de sistemas lineales de control de gran tama帽o. Tambi茅n se ha mejorado la aplicabilidad del software existente en ScaLAPACK para el problema de valores propios cubriendo casos que no se contemplaban. Se ha trabajado en la soluci贸n del problema generalizado (mediante su transformaci贸n a forma est谩ndar, lo que no es aplicable en todos los casos) y tambi茅n en el c谩lculo de los vectores propios. Ambas operaciones se han utilizado en un problema real de simulaci贸n de flujos oce谩nicos. En esta aplicaci贸n se requiere el c谩lculo de todos los valores y vectores propios de un problema generalizado de gran dimensi贸n. Como consecuencia, ha sido posible resolver problemas de valores propios generalizados enormes (con matrices de m谩s de 400000 filas y columnas) que no hab铆an podido resolverse con anterioridad, permitiendo as铆 un estudio m谩s preciso del comportamiento de las corrientes oce谩nicas.[CA] En el camp de la teoria de control de vegades apareixen models de sistemes amb un tamany elevat (moltes variables d'estat). Quan es pret茅n simular, estudiar o controlar un d'aquests sistemes d'ordre elevat, conv茅 realitzar un treball previ de reducci贸 del model del sistema amb el prop貌sit de reduir els costos (econ貌mics/temporals) necesaris en un tractament posterior. El proc茅s d'obtenci贸 d'un sistema d'ordre reduit que represente adequadament el sistema original sol ser cost贸s, perque necess脿riament ha de fer-se amb el sistema original sense reduir. Per aquest motiu, resulta convenient disposar d'implementacions d'altes prestacions per al problema de reducci贸 de sistemes lineals de control. En aquesta tesis s'han desenvolupat implementacions d'altes prestacions per a alguns m猫todes de reducci贸 de models. S'han anal路litzat els algoritmes existents per a la reducci贸 de models de sistemes lineals de control estables i les seues implementacions en la llibreria de control SLICOT. S'han proposat nous algoritmes paral路lels per als m猫todes basats en la resoluci贸 d'equacions de Lyapunov. Les noves rutines desenvolupades s'incorporen a la llibreria de computaci贸 d'altes prestacions per a control PSLICOT. Apart de les funcions principals a c脿rrec de la reducci贸 de models, s'han hagut de paral路le\-lit\-zar tamb茅 totes aquelles operacions num猫riques que apareixen en aquest problema i per a les que no es disposava de versions d'altes prestacions. D'aquestes operacions, destaquen rutines paral路leles per a la resoluci贸 de l'equaci贸 de Lyapunov en la seua forma est脿ndard obtenint directament el factor de Cholesky de la soluci贸, que 茅s el que es necessita per a la reducci贸 de models. El m猫tode emprat 茅s una versi贸 paral路lela del m猫tode de Hammarling. Les rutines implementades resolen en paral路lel i per a matrius denses les quatre variants possibles de l'equaci贸 de Lyapunov: en la seua forma discreta i cont铆nua, traspostes i sense trasposar. Totes les rutines paral路lelitzades ofereixen una interfa莽 com la de les rutines de la llibreria ScaLAPACK, per a que puguen ser usades f脿cilment per l'usuari acostumat a treballar amb aquest tipus de llibreries. Es permeten les mateixes distribucions de dades que en aquesta llibreria: distribuci贸 c铆clica 2D per blocs, que engloba moltes altres distribucions. Gr脿cies al treball desenvolupat, ara es disposa de versions paral路leles d'altes prestacions per a reduir sistemes lineals de control mitjan莽ant diferents variants del m猫tode de balancejat i truncament de l'arrel quadrada (the Square-Root Balance & Truncate model reduction method): amb o sense balancejat i amb o sense usar les f贸rmules de perturbaci贸 singular. Son versions paral路leles dels mateixos algoritmes i m猫todes que s'utilitzen en les versions sequencials de la llibreria SLICOT. Aix貌 permetr脿 reduir de forma eficient models de sistemes lineals de control de gran tamany. Tamb茅 s'ha mitjorat l'aplicabilitat del software existent en ScaLAPACK per al problema de valors propis cobrint casos que no es contemplaven. S'ha treballat en la soluci贸 del problema generalitzat (mitjan莽ant la seua transformaci贸 a forma est脿ndard, cosa que no es pot fer sempre) i tamb茅 en el c脿lcul dels vectors propis. Ambdues operacions s'han utilitzat en un problema real de simulaci贸 de fluxos oce脿nics. En aquesta aplicaci贸 es requereix el c脿lcul de tots els valors i vectors propis d'un problema generalitzat de gran dimensi贸. Com a conseq眉猫ncia, ha sigut possible resoldre problemes de valors propis generalitzats molt grans (amb matrius de m茅s de 400000 files i columnes) que no s'havien pogut resoldre anteriorment, permetent aix铆 un estudi m茅s prec铆s del comportament de les corrents oce脿niques.Guerrero L贸pez, D. (2015). Algoritmos paralelos para la reducci贸n de sistemas lineales de control estables [Tesis doctoral no publicada]. Universitat Polit猫cnica de Val猫ncia. https://doi.org/10.4995/Thesis/10251/59415TESI
    corecore