Search CORE

289 research outputs found

Recommended from our members

Efficient proton computed tomography image reconstruction using general purpose graphics processing units

Author: McAllister Scott Alan
Publication venue: CSUSB ScholarWorks
Publication date: 01/01/2009
Field of study

This thesis will investigate the use of general purpose graphics processing units (GPGPUs)for execution of parallel algorithms as well as exploiting the structure of the data being processed

CSUSB ScholarWorks

On the testing of special memories in GPGPUs

Author: Reorda Matteo Sonza
Rodriguez Condia Josie E.
Publication venue: IEEE
Publication date: 01/01/2020
Field of study

Nowadays, data-intensive processing applications, such as multimedia, high-performance computing and safety-critical ones (e.g., in automotive) employ General Purpose Graphics Processing Units (GPGPUs) due to their parallel processing capabilities and high performance. In these devices, multiple levels of memories are employed in GPGPUs to hide latency and increase the performance during the operation of a kernel. Moreover, modern GPGPU architectures implement cutting-edge semiconductor technologies, reducing their size and power consumption. However, some studies proved that these technologies are prone to faults during the operative life of a device, so compromising reliability. In this work, we developed functional test techniques based on parallel Software-Based Self-Test routines to test memory structures in the memory hierarchy of a GPGPU (FlexGripPlus) implementing the G80 architecture of Nvidia

Crossref

ZENODO

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Tackling Choke Point Induced Performance Bottlenecks in a Near-Threshold GPGPU

Author: Shabanian Tahmoures
Publication venue: DigitalCommons@USU
Publication date: 01/08/2018
Field of study

Over the last decade, General Purpose Graphics Processing Units (GPGPUs) have garnered a substantial attention in the research community due to their extensive thread-level parallelism. GPGPUs provide a remarkable performance improvement over Central Processing Units (CPUs), for highly parallel applications. However, GPGPUs typically achieve this extensive thread-level parallelism at the cost of a large power consumption. Consequently, Near-Threshold Computing (NTC) provides a promising opportunity for designing energy-efficient GPGPUs (NTC-GPUs). However, NTC-GPUs suffer from a crucial Process Variation (PV)-inflicted performance bottleneck, which is called Choke Point. Choke Point is defined as one or small group of gates which is affected by PV. Choke Point is capable of varying the path-delay of circuit and causing different forms of timing violation. In this work, a cross-layer design technique is proposed to tackle the performance impediments caused by choke points in NTC-GPUs

DigitalCommons@USU

DYRE: a DYnamic REconfigurable solution to increase GPGPU's reliability

Author: Narducci Pierpaolo
Rodriguez Condia Josie E.
Sonza Reorda Matteo
Sterpone Luca
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

General-purpose graphics processing units (GPGPUs) are extensively used in high-performance computing. However, it is well known that these devices’ reliability may be limited by the rising of faults at the hardware level. This work introduces a flexible solution to detect and mitigate permanent faults affecting the execution units in these parallel devices. The proposed solution is based on adding some spare modules to perform two in-field operations: detecting and mitigating faults. The solution takes advantage of the regularity of the execution units in the device to avoid significant design changes and reduce the overhead. The proposed solution was evaluated in terms of reliability improvement and area, performance, and power overhead costs. For this purpose, we resorted to a micro-architectural open-source GPGPU model (FlexGripPlus). Experimental results show that the proposed solution can extend the reliability by up to 57%, with overhead costs lower than 2% and 8% in area and power, respectively

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Open Access Repository