Search CORE

29 research outputs found

Performance Evaluation of Sparse Matrix Multiplication Kernels on Intel Xeon Phi

Author: E-J Im
J Mellor-Crummey
M Krotkiewski
R Nishtala
Publication venue
Publication date: 05/02/2013
Field of study

Intel Xeon Phi is a recently released high-performance coprocessor which features 61 cores each supporting 4 hardware threads with 512-bit wide SIMD registers achieving a peak theoretical performance of 1Tflop/s in double precision. Many scientific applications involve operations on large sparse matrices such as linear solvers, eigensolver, and graph mining algorithms. The core of most of these applications involves the multiplication of a large, sparse matrix with a dense vector (SpMV). In this paper, we investigate the performance of the Xeon Phi coprocessor for SpMV. We first provide a comprehensive introduction to this new architecture and analyze its peak performance with a number of micro benchmarks. Although the design of a Xeon Phi core is not much different than those of the cores in modern processors, its large number of cores and hyperthreading capability allow many application to saturate the available memory bandwidth, which is not the case for many cutting-edge processors. Yet, our performance studies show that it is the memory latency not the bandwidth which creates a bottleneck for SpMV on this architecture. Finally, our experiments show that Xeon Phi's sparse kernel performance is very promising and even better than that of cutting-edge general purpose processors and GPUs

arXiv.org e-Print Archive

Crossref

Heterogeneous computing architecture for fast detection of SNP-SNP interactions

Author: Curk Tomaz
Sluga Davor
Uros Lotric
Zupan Blaz
Publication venue
Publication date: 01/01/2013
Field of study

The extent of data in a typical genome-wide association study (GWAS) poses considerable computational challenges to software tools for gene-gene interaction discovery. Exhaustive evaluation of all interactions among hundreds of thousands to millions of single nucleotide polymorphisms (SNPs) may require weeks or even months of computation. Massively parallel hardware within a modern Graphic Processing Unit (GPU) and Many Integrated Core (MIC) coprocessors can shorten the run time considerably. While the utility of GPU-based implementations in bioinformatics has been well studied, MIC architecture has been introduced only recently and may provide a number of comparative advantages that have yet to be explored and tested. We have developed a heterogeneous, GPU and Intel MIC-accelerated software module for SNP-SNP interaction discovery to replace the previously single-threaded computational core in the interactive web-based data exploration program SNPsyn. We report on differences between these two modern massively parallel architectures and their software environments. Their utility resulted in an order of magnitude shorter execution times when compared to the single-threaded CPU implementation. GPU implementation on a single Nvidia Tesla K20 runs twice as fast as that for the MIC architecture-based Xeon Phi P5110 coprocessor, but also requires considerably more programming effort. General purpose GPUs are a mature platform with large amounts of computing power capable of tackling inherently parallel problems, but can prove demanding for the programmer. On the other hand the new MIC architecture, albeit lacking in performance reduces the programming effort and makes it up with a more general architecture suitable for a wider range of problems

Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors

Author
Publication venue: Springer
Publication date: 29/09/2016
Field of study

Springer - Publisher Connector

Explicit Fourth-Order Runge–Kutta Method on Intel Xeon Phi Coprocessor

Author: A Deslauriers
B Bylina
B Bylina
Beata Bylina
C Lawson
E Anderson
E Hairer
G Bianchi
IS Duff
J Bylina
J Bylina
J Bylina
J Jeffers
J Klamka
JC Butcher
JE Stone
JJ Dongarra
Joanna Potiopa
R Rahman
W Whitt
WH Press
WJ Stewart
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref