Search CORE

7 research outputs found

Boosting the performance of remote GPU virtualization using InfiniBand Connect-IB and PCIe 3.0

Author: Castello Gimeno Adrián
Duato Marín José Francisco
Peña Monferrer Antonio José
Quintana Ortí Enrique Salvador
Reaño González Carlos
Schultz Scot
Shainer Gilad
Silla Jiménez Federico
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 22/09/2014
Field of study

© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.[EN] A clear trend has emerged involving the acceleration of scientific applications by using GPUs. However, the capabilities of these devices are still generally underutilized. Remote GPU virtualization techniques can help increase GPU utilization rates, while reducing acquisition and maintenance costs. The overhead of using a remote GPU instead of a local one is introduced mainly by the difference in performance between the internode network and the intranode PCIe link. In this paper we show how using the new InfiniBand Connect-IB network adapters (attaining similar throughput to that of the most recently emerged GPUs) boosts the performance of remote GPU virtualization, reducing the overhead to a mere 0.19% in the application tested.This work was funded by the Generalitat Valenciana under Grant PROMETEOII/2013/009 of the PROMETEO program phase II. This material is based upon work supported by the U. S. Department of Energy, Office of Science, Advanced Scientific Computing Research (SC-21), under Contract No. DE-AC02-06CH11357. Authors from the Universitat Politècnica de València and Universitat Jaume I are grateful for the generous support provided by Mellanox Technologies.Reaño González, C.; Silla Jiménez, F.; Peña Monferrer, AJ.; Shainer, G.; Schultz, S.; Castelló Gimeno, A.; Quintana Orti, ES.... (2014). Boosting the performance of remote GPU virtualization using InfiniBand Connect-IB and PCIe 3.0. En 2014 IEEE International Conference on Cluster Computing (CLUSTER). IEEE. 266-267. doi:10.1109/CLUSTER.2014.6968737S26626

Crossref

RiuNet

The ParaPhrase project : parallel patterns for adaptive heterogeneous multicore systems

Author: Aldinucci Marco
Brown Christopher Mark
Cesarini Francesco
Danelutto Marco
González-Vélez Horacio
Hammond Kevin
Keller Rainer
Kilpatrick Peter
Rossbory Michael
Shainer Gilad
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2019
Field of study

Funding: This work has been supported by the European Union Framework 7 grant IST-2011-288570 “ParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore Systems”This paper describes the ParaPhrase project, a new 3-year targeted research project funded under EU Framework 7 Objective 3.4 (Computer Systems) , starting in October 2011. ParaPhrase aims to follow a new approach to introducing parallelism using advanced refactoring techniques coupled with high-level parallel design patterns. The refactoring approach will use these design patterns to restructure programs defined as networks of software components into other forms that are more suited to parallel execution. The programmer will be aided by high-level cost information that will be integrated into the refactoring tools. The implementation of these patterns will then use a well-understood algorithmic skeleton approach to achieve good parallelism. A key ParaPhrase design goal is that parallel components are intended to match heterogeneous architectures, defined in terms of CPU/GPU combinations, for example. In order to achieve this, the ParaPhrase approach will map components at link time to the available hardware, and will then re-map them during program execution, taking account of multiple applications, changes in hardware resource availability, the desire to reduce communication costs etc. In this way, we aim to develop a new approach to programming that will be able to produce software that can adapt to dynamic changes in the system environment. Moreover, by using a strong component basis for parallelism, we can achieve potentially significant gains in terms of reducing sharing at a high level of abstraction, and so in reducing or even eliminating the costs that are usually associated with cache management, locking, and synchronisation.Postprin

St Andrews Research Repository

Architecture and Implementation of Sockets Direct Protocol in Windows

Author: Dror Goldenberg
Gilad Shainer
Tzachi Dar
Publication venue
Publication date
Field of study

Sockets Direct Protocol (SDP) is a byte stream protocol that utilizes the capabilities of the InfiniBand fabric to transparently achieve performance gains for existing socket-based networked applications. We implemented SDP stack for the Windows operating system that is fully interoperable with Linux. The paper describes the early experience with the implementation of the protocol stack. We go through the motivation, the main implementation architectural aspects and challenges. We present preliminary performance results. Running over 20Gb/s InfiniBand double data rate (DDR) links we observed bandwidth record o

CiteSeerX

Local and Remote GPUs Perform Similar with EDR 100G InfiniBand

Author: Reaño Carlos
Schultz Scot
Shainer Gilad
Silla Federico
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2015
Field of study

Queen's University Belfast Research Portal

Boosting the Performance of Remote GPU Virtualization Using InfiniBand Connect-IB and PCIe 3.0

Author: Castello Adrian
Duato J.
Peña Antonio
Quintana Orti Enrique
Reaño Carlos
Schultz Scot
Shainer Gilad
Silla Federico
Publication venue
Publication date: 01/01/2014
Field of study

Queen's University Belfast Research Portal

Crossref

RiuNet