Search CORE

33 research outputs found

An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations

Author: Anzt Hartwig
Heuveline Vincent
Rocker Björn
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2010
Field of study

KITopen

Mixed Precision Error Correction Methods for Linear Systems: Convergence Analysis based on Krylov Subspace Methods

Author: Anzt Hartwig
Heuveline Vincent
Rocker Björn
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2010
Field of study

KITopen

Energy Efficiency of Mixed Precision Iterative Refinement Methods using Hybrid Hardware Platforms: An Evaluation of different Solver and Hardware Configurations

Author: Anzt Hartwig
Heuveline Vincent
Rocker Björn
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2010
Field of study

KITopen

GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement

Author: Anzt H.
Dongarra J.
Heuveline Vincent
Luszczek P.
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2011
Field of study

In hardware-aware high performance computing, block-asynchronous iteration and mixed precision iterative refinement are two techniques that may be used to leverage the computing power of SIMD accelerators like GPUs in the iterative solution of linear equation systems. although they use a very different approach for this purpose, they share the basic idea of compensating the convergence properties of an inferior numerical algorithm by a more efficient usage of the provided computing power. In this paper, we analyze the potential of combining both techniques. Therefore, we derive a mixed precision iterative refinement algorithm using a block-asynchronous iteration as an error correction solver, and compare its performance with a pure implementation of a block-asynchronous iteration and an iterative refinement method using double precision for the error correction solver. For matrices from the University of Florida Matrix collection, we report the convergence behaviour and provide the total solver runtime using different GPU architectures

KITopen

GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement

Author: A. Buttari
A. Frommer
D. Chazan
D. Göddeke
H. Anzt
M. Baboulin
U. Aydin
Z.-Z. Bai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref

KITopen

A modular precision format for decoupling arithmetic format and storage format

Author: A Buttari
D Göddeke
E Carson
E Carson
H Anzt
Hartwig Anzt
KE Prikopa
NJ Higham
Publication venue: Springer
Publication date: 01/01/2019
Field of study

Crossref

KITopen

Physics in Design:Real-time Numerical Simulation Integrated into the CAD Environment

Author: Wits Wessel W.
Zwier Marijn P.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

As today's markets are more susceptible to rapid changes and involve global players, a short time to market is required to keep a competitive edge. Concurrently, products are integrating an increasing number of functions and technologies, thus becoming progressively complex. Therefore, efficient and effective product development is essential. For early design phases, in which a large portion of the product cost is determined, it is important that different concepts can be developed and evaluated quickly. An established way of evaluating a design is using numerical methods, such as Finite Element Analysis (FEA). However, setting up numerical simulations in early design phases when concepts change repeatedly is time consuming. This is largely due to the fact that for each design change concepts need to be re-meshed, boundary conditions re-applied and solutions re-calculated. In this paper, a framework is proposed that establishes a real-time connection between the CAD environment and FEA software. Simulation results are automatically updated when the CAD model is updated. Partial re-meshing and smart boundary condition re-application techniques allow for a real-time assessment of design changes. The developed framework is especially interesting for the assessment of multi-physics phenomena in early design phases, as multiple fields can be interpreted by a design engineer that is usually specialized in a specific field

University of Twente Research Information

Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Graphics Processors

Author: Anzt Hartwig
Castillo Maribel
Fernandez Juan C.
Heuveline Vincent
Igual Francisco D.
Mayo Rafael
Quintana-Orti Enrique S.
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2011
Field of study

KITopen

Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems

Author: Anzt Hartwig
Castillo Maribel
Fernández Juan C.
Heuveline Vincent
Mayo Rafael
Quintana-Orti Enrique S.
Rocker Björn
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2011
Field of study

KITopen

Fast recursive filters for simulating nonlinear dynamic systems

Author: van Hateren J. H.
Publication venue
Publication date: 30/08/2007
Field of study

A fast and accurate computational scheme for simulating nonlinear dynamic systems is presented. The scheme assumes that the system can be represented by a combination of components of only two different types: first-order low-pass filters and static nonlinearities. The parameters of these filters and nonlinearities may depend on system variables, and the topology of the system may be complex, including feedback. Several examples taken from neuroscience are given: phototransduction, photopigment bleaching, and spike generation according to the Hodgkin-Huxley equations. The scheme uses two slightly different forms of autoregressive filters, with an implicit delay of zero for feedforward control and an implicit delay of half a sample distance for feedback control. On a fairly complex model of the macaque retinal horizontal cell it computes, for a given level of accuracy, 1-2 orders of magnitude faster than 4th-order Runge-Kutta. The computational scheme has minimal memory requirements, and is also suited for computation on a stream processor, such as a GPU (Graphical Processing Unit).Comment: 20 pages, 8 figures, 1 table. A comparison with 4th-order Runge-Kutta integration shows that the new algorithm is 1-2 orders of magnitude faster. The paper is in press now at Neural Computatio

arXiv.org e-Print Archive

CiteSeerX

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

University of Groningen Digital Archive

Dissertations of the University of Groningen