Search CORE

2 research outputs found

Analytical Modeling of Optimized Sparse Linear Code

Author: D.B. Heras
M. Wolfe
S. Rollin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Analytical Modeling of Optimized Sparse Linear Code

Author: Ivan ˇ Simeček
Pavel Tvrdík
Publication venue
Publication date
Field of study

Abstract In this paper, we describe source code transformations based on sw-pipelining, loop unrolling, and loop fusion for the sparse matrix-vector multiplication and for the Conjugate Gradient algorithm that enable data prefetching and overlapping of load and FPU arithmetic instructions and improve the temporal cache locality. We develop a probabilistic model for estimation of the numbers of cache misses for 3 types of data caches: direct mapped and s-way set associative with random and with LRU replacement strategies. Using HW cache monitoring tools, we compare the predicted number of cache misses with real numbers on Intel x86 architecture with L1 and L2 caches. The accuracy of our analytical model is around 97%. The errors in estimations are due to minor simplifying assumptions in our model.

CiteSeerX