2,713 research outputs found
A general framework for efficient FPGA implementation of matrix product
Original article can be found at: http://www.medjcn.com/ Copyright Softmotor LimitedHigh performance systems are required by the developers for fast processing of computationally intensive applications. Reconfigurable hardware devices in the form of Filed-Programmable Gate Arrays (FPGAs) have been proposed as viable system building blocks in the construction of high performance systems at an economical price. Given the importance and the use of matrix algorithms in scientific computing applications, they seem ideal candidates to harness and exploit the advantages offered by FPGAs. In this paper, a system for matrix algorithm cores generation is described. The system provides a catalog of efficient user-customizable cores, designed for FPGA implementation, ranging in three different matrix algorithm categories: (i) matrix operations, (ii) matrix transforms and (iii) matrix decomposition. The generated core can be either a general purpose or a specific application core. The methodology used in the design and implementation of two specific image processing application cores is presented. The first core is a fully pipelined matrix multiplier for colour space conversion based on distributed arithmetic principles while the second one is a parallel floating-point matrix multiplier designed for 3D affine transformations.Peer reviewe
TANGO: Transparent heterogeneous hardware Architecture deployment for eNergy Gain in Operation
The paper is concerned with the issue of how software systems actually use
Heterogeneous Parallel Architectures (HPAs), with the goal of optimizing power
consumption on these resources. It argues the need for novel methods and tools
to support software developers aiming to optimise power consumption resulting
from designing, developing, deploying and running software on HPAs, while
maintaining other quality aspects of software to adequate and agreed levels. To
do so, a reference architecture to support energy efficiency at application
construction, deployment, and operation is discussed, as well as its
implementation and evaluation plans.Comment: Part of the Program Transformation for Programmability in
Heterogeneous Architectures (PROHA) workshop, Barcelona, Spain, 12th March
2016, 7 pages, LaTeX, 3 PNG figure
Modern middleware for the data acquisition of the Cherenkov Telescope Array
The data acquisition system (DAQ) of the future Cherenkov Telescope Array
(CTA) must be ef- ficient, modular and robust to be able to cope with the very
large data rate of up to 550 Gbps coming from many telescopes with different
characteristics. The use of modern middleware, namely ZeroMQ and Protocol
Buffers, can help to achieve these goals while keeping the development effort
to a reasonable level. Protocol Buffers are used as an on-line data for- mat,
while ZeroMQ is employed to communicate between processes. The DAQ will be
controlled and monitored by the Alma Common Software (ACS). Protocol Buffers
from Google are a way to define high-level data structures through an in-
terface description language (IDL) and a meta-compiler. ZeroMQ is a middleware
that augments the capabilities of TCP/IP sockets. It does not implement very
high-level features like those found in CORBA for example, but makes use of
sockets easier, more robust and almost as effective as raw TCP. The use of
these two middlewares enabled us to rapidly develop a robust prototype of the
DAQ including data persistence to compressed FITS files.Comment: In Proceedings of the 34th International Cosmic Ray Conference
(ICRC2015), The Hague, The Netherlands. All CTA contributions at
arXiv:1508.0589
Status and Plans for the Array Control and Data Acquisition System of the Cherenkov Telescope Array
The Cherenkov Telescope Array (CTA) is the next-generation atmospheric
Cherenkov gamma-ray observatory. CTA will consist of two installations, one in
the northern, and the other in the southern hemisphere, containing tens of
telescopes of different sizes. The CTA performance requirements and the
inherent complexity associated with the operation, control and monitoring of
such a large distributed multi-telescope array leads to new challenges in the
field of the gamma-ray astronomy. The ACTL (array control and data acquisition)
system will consist of the hardware and software that is necessary to control
and monitor the CTA arrays, as well as to time-stamp, read-out, filter and
store -at aggregated rates of few GB/s- the scientific data. The ACTL system
must be flexible enough to permit the simultaneous automatic operation of
multiple sub-arrays of telescopes with a minimum personnel effort on site. One
of the challenges of the system is to provide a reliable integration of the
control of a large and heterogeneous set of devices. Moreover, the system is
required to be ready to adapt the observation schedule, on timescales of a few
tens of seconds, to account for changing environmental conditions or to
prioritize incoming scientific alerts from time-critical transient phenomena
such as gamma ray bursts. This contribution provides a summary of the main
design choices and plans for building the ACTL system.Comment: In Proceedings of the 34th International Cosmic Ray Conference
(ICRC2015), The Hague, The Netherlands. All CTA contributions at
arXiv:1508.0589
- …