Search CORE

1,271 research outputs found

Towards Lattice Quantum Chromodynamics on FPGA devices

Author: Korcyl Grzegorz
Korcyl Piotr
Publication venue: 'Elsevier BV'
Publication date: 04/12/2019
Field of study

In this paper we describe a single-node, double precision Field Programmable Gate Array (FPGA) implementation of the Conjugate Gradient algorithm in the context of Lattice Quantum Chromodynamics. As a benchmark of our proposal we invert numerically the Dirac-Wilson operator on a 4-dimensional grid on three Xilinx hardware solutions: Zynq Ultrascale+ evaluation board, the Alveo U250 accelerator and the largest device available on the market, the VU13P device. In our implementation we separate software/hardware parts in such a way that the entire multiplication by the Dirac operator is performed in hardware, and the rest of the algorithm runs on the host. We find out that the FPGA implementation can offer a performance comparable with that obtained using current CPU or Intel's many core Xeon Phi accelerators. A possible multiple node FPGA-based system is discussed and we argue that power-efficient High Performance Computing (HPC) systems can be implemented using FPGA devices only.Comment: 17 pages, 4 figure

arXiv.org e-Print Archive

Jagiellonian Univeristy Repository

Evaluation of Single-Chip, Real-Time Tomographic Data Processing on FPGA - SoC Devices

Author: Białas P.
Curceanu C.
Czerwiński E.
Dulski K.
Flak B.
Gajos A.
Gorgol M.
Głowacz B.
Hiesmayr B. C.
Jasińska B.
Kacprzak K.
Kajetanowicz M.
Kisielewska D.
Korcyl G.
Kowalski P.
Kozik T.
Krawczyk N.
Krzemień W.
Kubicz E.
Mohammed M.
Moskal P.
Niedźwiecki Sz.
Pawlik-Niedźwiecka M.
Pałka M.
Raczyński L.
Rajda P.
Rudy Z.
Salabura P.
Sharma N. G.
Sharma S.
Shopa R. Y.
Silarski M.
Skurzok M.
Strzempek P.
Wieczorek A.
Wiślicki W.
Zaleski R.
Zgardzińska B.
Zieliński M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

A novel approach to tomographic data processing has been developed and evaluated using the Jagiellonian PET (J-PET) scanner as an example. We propose a system in which there is no need for powerful, local to the scanner processing facility, capable to reconstruct images on the fly. Instead we introduce a Field Programmable Gate Array (FPGA) System-on-Chip (SoC) platform connected directly to data streams coming from the scanner, which can perform event building, filtering, coincidence search and Region-Of-Response (ROR) reconstruction by the programmable logic and visualization by the integrated processors. The platform significantly reduces data volume converting raw data to a list-mode representation, while generating visualization on the fly.Comment: IEEE Transactions on Medical Imaging, 17 May 201

arXiv.org e-Print Archive

Jagiellonian Univeristy Repository

Using System-on-a-Programmable-Chip Technology to Design Embedded Systems

Author: Hall Tyson S.
Hamblen James O.
Publication venue: 'IEREK Research Enrichment and Knowledge Exchange'
Publication date: 01/09/2006
Field of study

This paper describes the tools, techniques, and devices used to design embedded products with system–on-a-chip (SoC) type solutions using a large Field Programmable Gate Array (FPGA) with an internal processor core. This new FPGA-based approach is called system-on-a-programmable-chip (SoPC ). The performance tradeoffs present in SoPC systems is compared to more traditional design approaches. Commercial devices, processor cores, and CAD tool flows are described. The issues in SoPC hardware/software design tradeoffs are examined and three example SoPC designs are presented as case studies

A cross-platform OpenVX library for FPGA accelerators

Author: Davila-Guzman M. A.
Gran Tejero R.
Göhringer D.
Kalms L.
Suarez Gracia D.
Villarroya-Gaudo M.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

FPGAs are an excellent platform to implement computer vision applications, since these applications tend to offer a high level of parallelism with many data-independent operations. However, the freedom in the solution design space of FPGAs represents a problem because each solution must be individually designed, verified, and tuned. The emergence of High Level Synthesis (HLS) helps solving this problem and has allowed the implementation of open programming standards as OpenVX for computer vision applications on FPGAs, such as the HiF1ipVX library developed exclusively for Xilinx devices. Although with the HiF1ipVX library, designers can develop solutions efficiently on Xilinx, they do not have an approach to port and run their code on FPGAs from other manufacturers. This work extends the HiFlipVX capabilities in two significant ways: supporting Intel FPGA devices and enabling execution on discrete FPGA accelerators. To provide both without affecting user-facing code, the new carried out implementation combines two HLS programming models: C++, using Intel''s system of tasks, and OpenCL, which provides the CPU interoperability. Comparing with pure OpenCL implementations, this work reduces kernel dispatch resources, saving up to 24% of ALUT resources for each kernel in a graph, and improves performance 2.6 x and energy consumption 1.6 x on average for a set of representative applications, compared with state-of-the-art frameworks

FOS: A Modular FPGA Operating System for Dynamic Workloads

Author: Koch Dirk
Pham Khoa
Powell Joseph
Vaishnav Anuj
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/01/2020
Field of study

With FPGAs now being deployed in the cloud and at the edge, there is a need for scalable design methods which can incorporate the heterogeneity present in the hardware and software components of FPGA systems. Moreover, these FPGA systems need to be maintainable and adaptable to changing workloads while improving accessibility for the application developers. However, current FPGA systems fail to achieve modularity and support for multi-tenancy due to dependencies between system components and lack of standardised abstraction layers. To solve this, we introduce a modular FPGA operating system -- FOS, which adopts a modular FPGA development flow to allow each system component to be changed and be agnostic to the heterogeneity of EDA tool versions, hardware and software layers. Further, to dynamically maximise the utilisation transparently from the users, FOS employs resource-elastic scheduling to arbitrate the FPGA resources in both time and spatial domain for any type of accelerators. Our evaluation on different FPGA boards shows that FOS can provide performance improvements in both single-tenant and multi-tenant environments while substantially reducing the development time and, at the same time, improving flexibility

arXiv.org e-Print Archive