133 research outputs found

    John von Neumann-Institut für Computing

    No full text
    The Basic Linear Algebra Subprograms, BLAS, are the basic computational kernels in most applications. BLAS 1 and BLAS 2, the vector-vector and matrix-vector routines, require memory accesses in the same order as computations and thus cannot achieve performance close to peak performance on modern computer architectures. BLAS 3 matrix-matrix operations on n × n-matrices on the other side can do order n 3 operations with only order n 2 memory accesses. This much better ratio of computation to memory access allows for much higher performance. To show which performance can be expected using the BLAS routines from IBM’s ESSL on an IBM p690 we investigated the performance of one routine of each BLAS level and compared The Basic Linear Algebra Subprograms, BLAS ([1], [2], [3]), have been the basic building blocks for application programs since the 1980th with the upcoming vector machines and even more since the 1990th with the BLAS 3 being optimall
    • …
    corecore