Search CORE

294 research outputs found

Accurate and Efficient Expression Evaluation and Linear Algebra

Author: Aho
Demmel
Demmel
Hong
Ioana Dumitriu
James Demmel
Karlin
Macdonald
Martínez
Miller
Olga Holtz
Parlett
Peña
Plamen Koev
Reznick
Tarski
Taylor
Ye
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2007
Field of study

We survey and unify recent results on the existence of accurate algorithms for evaluating multivariate polynomials, and more generally for accurate numerical linear algebra with structured matrices. By "accurate" we mean that the computed answer has relative error less than 1, i.e., has some correct leading digits. We also address efficiency, by which we mean algorithms that run in polynomial time in the size of the input. Our results will depend strongly on the model of arithmetic: Most of our results will use the so-called Traditional Model (TM). We give a set of necessary and sufficient conditions to decide whether a high accuracy algorithm exists in the TM, and describe progress toward a decision procedure that will take any problem and provide either a high accuracy algorithm or a proof that none exists. When no accurate algorithm exists in the TM, it is natural to extend the set of available accurate operations by a library of additional operations, such as

x+y+z

, dot products, or indeed any enumerable set which could then be used to build further accurate algorithms. We show how our accurate algorithms and decision procedure for finding them extend to this case. Finally, we address other models of arithmetic, and the relationship between (im)possibility in the TM and (in)efficient algorithms operating on numbers represented as bit strings.Comment: 49 pages, 6 figures, 1 tabl

arXiv.org e-Print Archive

CiteSeerX

Crossref

Accurate solution of structured least squares problems via rank-revealing decompositions

Author: Castro González Nieves
Ceballos Cañón Johan Armando
Martínez Dopico Froilán César
Molera Molera Juan Manuel
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2013
Field of study

Least squares problems min(x) parallel to b - Ax parallel to(2) where the matrix A is an element of C-mXn (m >= n) has some particular structure arise frequently in applications. Polynomial data fitting is a well-known instance of problems that yield highly structured matrices, but many other examples exist. Very often, structured matrices have huge condition numbers kappa(2)(A) = parallel to A parallel to(2) parallel to A(dagger)parallel to(2) (A(dagger) is the Moore-Penrose pseudoinverse of A) and therefore standard algorithms fail to compute accurate minimum 2-norm solutions of least squares problems. In this work, we introduce a framework that allows us to compute minimum 2-norm solutions of many classes of structured least squares problems accurately, i.e., with errors parallel to(x) over cap (0) - x(0)parallel to(2)/parallel to x(0)parallel to(2) = O(u), where u is the unit roundoff, independently of the magnitude of kappa(2)(A) for most vectors b. The cost of these accurate computations is O(n(2)m) flops, i.e., roughly the same cost as standard algorithms for least squares problems. The approach in this work relies in computing first an accurate rank-revealing decomposition of A, an idea that has been widely used in recent decades to compute, for structured ill-conditioned matrices, singular value decompositions, eigenvalues, and eigenvectors in the Hermitian case and solutions of linear systems with high relative accuracy. In order to prove that accurate solutions are computed, a new multiplicative perturbation theory of the least squares problem is needed. The results presented in this paper are valid for both full rank and rank deficient problems and also in the case of underdetermined linear systems (m < n). Among other types of matrices, the new method applies to rectangular Cauchy, Vandermonde, and graded matrices, and detailed numerical tests for Cauchy matrices are presented.This work was supported by the Ministerio de Economía y Competitividad of Spain through grants MTM-2009-09281, MTM-2012-32542 (Ceballos, Dopico, and Molera) and MTM2010-18057 (Castro-González).Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

The Power of Bidiagonal Matrices

Author: Higham Nicholas J.
Publication venue
Publication date: 11/11/2023
Field of study

Bidiagonal matrices are widespread in numerical linear algebra, not least because of their use in the standard algorithm for computing the singular value decomposition and their appearance as LU factors of tridiagonal matrices. We show that bidiagonal matrices have a number of interesting properties that make them powerful tools in a variety of problems, especially when they are multiplied together. We show that the inverse of a product of bidiagonal matrices is insensitive to small componentwise relative perturbations in the factors if the factors or their inverses are nonnegative. We derive componentwise rounding error bounds for the solution of a linear system

Ax = b

, where

A

A^{-1}

is a product

B_1 B_2\dots B_k

of bidiagonal matrices, showing that strong results are obtained when the

B_i

are nonnegative or have a checkerboard sign pattern. We show that given the \fact\ of an

n\times n

totally nonnegative matrix

A

into the product of bidiagonal matrices,

\|A^{-1}\|_{\infty}

can be computed in

O(n^2)

flops and that in floating-point arithmetic the computed result has small relative error, no matter how large

\|A^{-1}\|_{\infty}

is. We also show how factorizations involving bidiagonal matrices of some special matrices, such as the Frank matrix and the Kac--Murdock--Szeg\"o matrix, yield simple proofs of the total nonnegativity and other properties of these matrices

arXiv.org e-Print Archive

Toward accurate polynomial evaluation in rounded arithmetic

Author: Demmel James
Dumitriu Ioana
Holtz Olga
Publication venue
Publication date: 01/01/2005
Field of study

Given a multivariate real (or complex) polynomial

p

and a domain

\cal D

, we would like to decide whether an algorithm exists to evaluate

p(x)

accurately for all

x \in {\cal D}

using rounded real (or complex) arithmetic. Here ``accurately'' means with relative error less than 1, i.e., with some correct leading digits. The answer depends on the model of rounded arithmetic: We assume that for any arithmetic operator

op(a,b)

, for example

a+b

a \cdot b

, its computed value is

op(a,b) \cdot (1 + \delta)

, where

| \delta |

is bounded by some constant

\epsilon

where

0 < \epsilon \ll 1

, but

\delta

is otherwise arbitrary. This model is the traditional one used to analyze the accuracy of floating point algorithms.Our ultimate goal is to establish a decision procedure that, for any

p

and

\cal D

, either exhibits an accurate algorithm or proves that none exists. In contrast to the case where numbers are stored and manipulated as finite bit strings (e.g., as floating point numbers or rational numbers) we show that some polynomials

p

are impossible to evaluate accurately. The existence of an accurate algorithm will depend not just on

p

and

\cal D

, but on which arithmetic operators and which constants are are available and whether branching is permitted. Toward this goal, we present necessary conditions on

p

for it to be accurately evaluable on open real or complex domains

{\cal D}

. We also give sufficient conditions, and describe progress toward a complete decision procedure. We do present a complete decision procedure for homogeneous polynomials

p

with integer coefficients, {\cal D} = \C^n, and using only the arithmetic operations

+

-

and

\cdot

.Comment: 54 pages, 6 figures; refereed version; to appear in Foundations of Computational Mathematics: Santander 2005, Cambridge University Press, March 200

arXiv.org e-Print Archive

CiteSeerX

Dagstuhl Research Online Publication Server

Vandermonde Neural Operators

Author: Katzschmann Robert K.
Lingsch Levi
Michelis Mike
Mishra Siddartha
Perera Sirani M.
Publication venue
Publication date: 05/06/2023
Field of study

Fourier Neural Operators (FNOs) have emerged as very popular machine learning architectures for learning operators, particularly those arising in PDEs. However, as FNOs rely on the fast Fourier transform for computational efficiency, the architecture can be limited to input data on equispaced Cartesian grids. Here, we generalize FNOs to handle input data on non-equispaced point distributions. Our proposed model, termed as Vandermonde Neural Operator (VNO), utilizes Vandermonde-structured matrices to efficiently compute forward and inverse Fourier transforms, even on arbitrarily distributed points. We present numerical experiments to demonstrate that VNOs can be significantly faster than FNOs, while retaining comparable accuracy, and improve upon accuracy of comparable non-equispaced methods such as the Geo-FNO.Comment: 21 pages, 10 figure

arXiv.org e-Print Archive