Search CORE

4 research outputs found

An efficient GPU implementation of fixed-complexity sphere decoders for MIMO wireless systems

Author: Almenar Terré Vicenç
González Salvador Alberto
Ramiro Sánchez Carla
Roger Varea Sandra
Vidal Maciá Antonio Manuel
Publication venue: 'IOS Press'
Publication date: 01/01/2012
Field of study

The use of many-core processors such as general purpose Graphic Processing Units (GPUs) has recently become attractive for the efficient implementation of signal processing algorithms for communication systems. This is due to the cost-effectiveness of GPUs together with their potential capability of parallel processing. This paper presents an implementation of the widely employed fixed-complexity sphere decoder on GPUs, which allows to considerably decrease the computational time required for the data detection stage in multiple-input multiple-output systems. Both, the hard-and soft-output versions of the method have been implemented. Speedup results show the proposed GPU implementation boosts the runtime of the parallel execution of the methods in a high performance multi-core CPU. In addition, the throughput of the algorithm is evaluated and is shown to outperform other recent implementations and to fulfill the real-time requirements of several LTE configurations. ©2012-IOS Press and the authors. All rights reserved.This work was partially funded by the TEC2009-13741 project of the Spanish Ministry of Science and by the PROMETEO/2009/013 project of the Generalitat Valenciana.Roger Varea, S.; Ramiro Sánchez, C.; González Salvador, A.; Almenar Terré, V.; Vidal Maciá, AM. (2012). An efficient GPU implementation of fixed-complexity sphere decoders for MIMO wireless systems. Integrated Computer-Aided Engineering. 19(4):341-350. https://doi.org/10.3233/ICA-2012-0410S34135019

Crossref

RiuNet

Parallel Implementation of a Real-Time High Dynamic Range Video System

Author: Effelsberg Wolfgang
Guthier Benjamin
Kopf Stephan
Wichtlhuber Matthias
Publication venue: 'IOS Press'
Publication date: 01/01/2014
Field of study

Abstract. This article describes the use of the parallel processing capabilities of a graphics chip to increase the processing speed of a high dynamic range (HDR) video system. The basis is an existing HDR video system that produces each frame from a sequence of regular images taken in quick succession under varying exposure settings. The image sequence is processed in a pipeline consisting of: shutter speeds selection, capturing, color space conversion, image registration, HDR stitching, and tone mapping. This article identifies bottlenecks in the pipeline and describes modifications to the algorithms that are necessary to enable parallel processing. Time-critical steps are processed on a graphics processing unit (GPU). The resulting processing time is evaluated and compared to the original sequential code. The creation of an HDR video frame is sped up by a factor of 15 on the average

CiteSeerX

MAnnheim DOCument Server

A low-cost 3D human interface device using GPU-based optical flow algorithms

Author
Publication venue: 'IOS Press'
Publication date
Field of study

Crossref