Search CORE

296 research outputs found

Fast block QR update in digital signal processing

Author: Alonso-Jordá Pedro
Alventosa Fran J.
Piñero Gema
Quintana-Ortí Enrique S.
Vidal Maciá Antonio Manuel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2019
Field of study

[EN] The processing of digital sound signals often requires the computation of the QR factorization of a rectangular system matrix. However, sometimes, only a given (and probably small) part of the system matrix varies from the current sample to the next one. We exploit this fact to reuse some computations carried out to process the former sample in order to save execution time in the processing of the current sample. These savings can be critical for real-time applications running on low power consumption devices with high mobility. In addition, we propose a simple out-of-order task-parallel algorithm for the QR factorization using OpenMP that exploits the multicore capability of modern processors. Furthermore, in the presence of a Graphics Processing Unit (GPU) in the system, our algorithm is able to off-load some tasks to the GPU to accelerate the computation on these hardware devices.This work was supported by the Spanish Ministry of Economy and Competitiveness under MINECO and FEDER projects TEC2015-67387-C4-1-R and TIN2014-53495-R; and the Generalitat Valenciana PROMETEOII/2014/003Alventosa, FJ.; Alonso-Jordá, P.; Vidal Maciá, AM.; Piñero, G.; Quintana-Ortí, ES. (2019). Fast block QR update in digital signal processing. The Journal of Supercomputing. 75(3):1051-1064. https://doi.org/10.1007/s11227-018-2298-5S10511064753Augonnet C, Thibault S, Namyst R (2010) StarPU: a runtime system for scheduling tasks over accelerator-based multicore machines. Research Report RR-7240, INRIAButtari A, Langou J, Kurzak J, Dongarra J (2008) Parallel tiled QR factorization for multicore architectures. Concurr Comput Pract Exp 20(13):1573–1590Buttari A, Langou J, Kurzak J, Dongarra J (2009) A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Comput 35(1):38–53Chan E, Quintana-Ortí ES, Quintana-Ortí G, van de Geijn R (2007) Supermatrix out-of-order scheduling of matrix operations for smp and multi-core architectures. In: Proceedings of the Nineteenth Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA ’07. ACM, New York, pp 116–125Chan E, Van Zee FG, Quintana-Ortí ES, Quintana-Ortí G, De Van Geijn R (2007) Satisfying your dependencies with supermatrix. In: Proceedings—2007 IEEE International Conference on Cluster Computing, CLUSTER 2007. pp 91–99Chan E, Van Zee FG, Bientinesi P, Quintana-Ortí ES, Quintana-Ortí G, van de Geijn RA (2008) Supermatrix: a multithreaded runtime scheduling system for algorithms-by-blocks. In: Chatterjee S, Scott ML (eds) PPOPP. ACM, New york, pp 123–132Golub GH, Van Loan CF (2013) Matrix computations. Johns Hopkins Studies in the Mathematical Sciences. Johns Hopkins University Press, BaltimoreGunter BC, van de Geijn RA (2005) Parallel out-of-core computation and updating the QR factorization. ACM Trans Math Softw 31(1):60–78Joffrain T, Quintana-Ortí ES, van de Geijn RA (2004) Rapid development of high-performance out-of-core solvers. In: Applied Parallel Computing, State of the Art in Scientific Computing, 7th International Workshop, PARA 2004, Lyngby, Denmark, June 20–23, 2004, revised selected papers. pp 413–422NVIDIA. The cuBLAS library. http://docs.nvidia.com/cuda/cublas . Accessed May 2017Openblas. http://www.openblas.net . Accessed May 2017Quintana-Ortí G, Quintana-Ortí ES, Van De Geijn RA, Van Zee FG, Chan E (2009) Programming matrix algorithms-by-blocks for thread-level parallelism. ACM Trans Math Softw 36(3):14:1–14:26The OmpSs Programming Model. https://pm.bsc.es/ompss . Accessed May 2017Wende F, Steinke T, Cordes F (2014) Multi-threaded kernel offloading to gpgpu using hyper-q on kepler architecture. Technical Report 14-19, ZIB, Takustr.7, 14195 Berli

Repositori Institucional de la Universitat Jaume I

RiuNet

Applied Parallel Computing, State of the Art in Scientific Computing, 7th International Workshop, PARA 2004, Lyngby, Denmark, June 20-23, 2004, Revised Selected Papers

Author
Publication venue: Springer Nature
Publication date: 01/01/2006
Field of study

The University of Manchester - Institutional Repository

Foundations for a new type of design-engineers – experiences from DTU meeting the CDIO concept

Author: Brodersen Søsser
Jørgensen Ulrik
Lindegaard Hanne
Publication venue: Technical University of Denmark
Publication date: 01/01/2011
Field of study

VBN

Online Research Database In Technology

Culture in Engineering Education:CDIO framing intercultural competences

Author: Christensen Hans Peter
Hoffmann Birgitte
Jørgensen Ulrik
Publication venue: Technical University of Denmark
Publication date: 01/01/2011
Field of study

VBN

Online Research Database In Technology

Enabling Technologies for Cognitive Optical Networks

Author: Borkowski Robert
Publication venue: 'American Association for Cancer Research (AACR)'
Publication date: 01/01/2014
Field of study

Online Research Database In Technology

Processing Decoded Video for LCD-LED Backlight Display:Post processing of decoded video and local backlight dimming for LCD technology with LED-based backlight

Author: Nadernejad Ehsan
Publication venue: Technical University of Denmark
Publication date: 01/01/2013
Field of study

Online Research Database In Technology

Conference proceedings 7th International CDIO Conference, Technical University of Denmark, 20th – 22nd June 2011

Author
Publication venue: Technical University of Denmark
Publication date: 01/01/2011
Field of study

Online Research Database In Technology

Nonlinear Source Emulator

Author: Nguyen-Duy Khiem
Publication venue: Technical University of Denmark, Department of Electrical Engineering
Publication date: 01/01/2015
Field of study