5 research outputs found

    Implementation of the conjugate gradient algorithm on FPGA devices

    Get PDF
    Results of porting parts of the Lattice Quantum Chromodynamics code to modern FPGA devices are presented. A single-node, double precision implementation of the Conjugate Gradient algorithm is used to invert numerically the Dirac-Wilson operator on a 4-dimensional grid on a Xilinx Zynq evaluation board. The code is divided into two software/hardware parts in such a way that the entire multiplication by the Dirac operator is performed in programmable logic, and the rest of the algorithm runs on the ARM cores. Optimized data blocks are used to efficiently use data movement infrastructure allowing to reach intervals of 1 clock cycle. We show that the FPGA implementation can offer a comparable performance compared to that obtained using Intel Xeon Phi KNL.Comment: Proceedings of the 36th Annual International Symposium on Lattice Field Theory - LATTICE201

    Towards Lattice Quantum Chromodynamics on FPGA devices

    Get PDF
    In this paper we describe a single-node, double precision Field Programmable Gate Array (FPGA) implementation of the Conjugate Gradient algorithm in the context of Lattice Quantum Chromodynamics. As a benchmark of our proposal we invert numerically the Dirac-Wilson operator on a 4-dimensional grid on three Xilinx hardware solutions: Zynq Ultrascale+ evaluation board, the Alveo U250 accelerator and the largest device available on the market, the VU13P device. In our implementation we separate software/hardware parts in such a way that the entire multiplication by the Dirac operator is performed in hardware, and the rest of the algorithm runs on the host. We find out that the FPGA implementation can offer a performance comparable with that obtained using current CPU or Intel's many core Xeon Phi accelerators. A possible multiple node FPGA-based system is discussed and we argue that power-efficient High Performance Computing (HPC) systems can be implemented using FPGA devices only.Comment: 17 pages, 4 figure

    Implementation of the conjugate gradient algorithm in Lattice QCD on FPGA devices

    Get PDF
    Results of porting parts of the Lattice Quantum Chromodynamics code to modern FPGA devices are presented. A single-node, double precision implementation of the Conjugate Gradient algorithm is used to invert numerically the Dirac-Wilson operator on a 4-dimensional grid on a Xilinx Zynq evaluation board. The code is divided into two software/hardware parts in such a way that the entire multiplication by the Dirac operator is performed in programmable logic, and the rest of the algorithm runs on the ARM cores. Optimized data blocks are used to efficiently use data movement infrastructure allowing to reach intervals of 1 clock cycle. We show that the FPGA implementation can offer a comparable performance compared to that obtained using Intel Xeon Phi KN

    FPGA-based range-limited molecular dynamics acceleration

    Full text link
    Molecular Dynamics (MD) is a computer simulation technique that executes iteratively over discrete, infinitesimal time intervals. It has been a widely utilized application in the fields of material sciences and computer-aided drug design for many years, serving as a crucial benchmark in high-performance computing (HPC). Numerous MD packages have been developed and effectively accelerated using GPUs. However, as the limits of Moore's Law are reached, the performance of an individual computing node has reached its bottleneck, while the performance of multiple nodes is primarily hindered by scalability issues, particularly when dealing with small datasets. In this thesis, the acceleration with respect to small datasets is the main focus. With the recent COVID-19 pandemic, drug discovery has gained significant attention, and Molecular Dynamics (MD) has emerged as a crucial tool in this process. Particularly, in the critical domain of drug discovery, small simulations involving approximately ~50K particles are frequently employed. However, it is important to note that small simulations do not necessarily translate to faster results, as long-term simulations comprising billions of MD iterations and more are essential in this context. In addition to dataset size, the problem of interest is further constrained. Referred to as the most computationally demanding aspect of MD, the evaluation of range-limited (RL) forces not only accounts for 90% of the MD computation workload but also involves irregular mapping patterns of 3-D data onto 2-D processor networks. To emphasize, this thesis centers around the acceleration of RL MD specifically for small datasets. In order to address the single-node bottleneck and multi-node scaling challenges, the thesis is organized into two progressive stages of investigation. The first stage delves extensively into enhancing single-node efficiency by examining various factors such as workload mapping from 3-D to 2-D, data routing, and data locality. The second stage focuses on studying multi-node scalability, with a particular emphasis on strong scaling, bandwidth demands, and the synchronization mechanisms between nodes. Through our study, the results show our design on a Xilinx U280 FPGA achieves 51.72x and 4.17x speedups with respect to an Intel Xeon Gold 6226R CPU, and a Quadro RTX 8000 GPU. Our research towards strong scaling also demonstrates that 8 Xilinx U280 FPGAs connected to a switch achieves 4.67x speedup compared to an Nvidia V100 GP

    Polymerization reactions and modifications of polymers by ionizing radiation

    Get PDF
    Ionizing radiation has become the most effective way to modify natural and synthetic polymers through crosslinking, degradation, and graft polymerization. This review will include an in-depth analysis of radiation chemistry mechanisms and the kinetics of the radiation-induced C-centered free radical, anion, and cation polymerization, and grafting. It also presents sections on radiation modifications of synthetic and natural polymers. For decades, low linear energy transfer (LLET) ionizing radiation, such as gamma rays, X-rays, and up to 10 MeV electron beams, has been the primary tool to produce many products through polymerization reactions. Photons and electrons interaction with polymers display various mechanisms. While the interactions of gamma ray and X-ray photons are mainly through the photoelectric effect, Compton scattering, and pair-production, the interactions of the high-energy electrons take place through coulombic interactions. Despite the type of radiation used on materials, photons or high energy electrons, in both cases ions and electrons are produced. The interactions between electrons and monomers takes place within less than a nanosecond. Depending on the dose rate (dose is defined as the absorbed radiation energy per unit mass), the kinetic chain length of the propagation can be controlled, hence allowing for some control over the degree of polymerization. When polymers are submitted to high-energy radiation in the bulk, contrasting behaviors are observed with a dominant effect of cross-linking or chain scission, depending on the chemical nature and physical characteristics of the material. Polymers in solution are subject to indirect effects resulting from the radiolysis of the medium. Likewise, for radiation-induced polymerization, depending on the dose rate, the free radicals generated on polymer chains can undergo various reactions, such as inter/intramolecular combination or inter/intramolecular disproportionation, b-scission. These reactions lead to structural or functional polymer modifications. In the presence of oxygen, playing on irradiation dose-rates, one can favor crosslinking reactions or promotes degradations through oxidations. The competition between the crosslinking reactions of C-centered free radicals and their reactions with oxygen is described through fundamental mechanism formalisms. The fundamentals of polymerization reactions are herein presented to meet industrial needs for various polymer materials produced or degraded by irradiation. Notably, the medical and industrial applications of polymers are endless and thus it is vital to investigate the effects of sterilization dose and dose rate on various polymers and copolymers with different molecular structures and morphologies. The presence or absence of various functional groups, degree of crystallinity, irradiation temperature, etc. all greatly affect the radiation chemistry of the irradiated polymers. Over the past decade, grafting new chemical functionalities on solid polymers by radiation-induced polymerization (also called RIG for Radiation-Induced Grafting) has been widely exploited to develop innovative materials in coherence with actual societal expectations. These novel materials respond not only to health emergencies but also to carbon-free energy needs (e.g., hydrogen fuel cells, piezoelectricity, etc.) and environmental concerns with the development of numerous specific adsorbents of chemical hazards and pollutants. The modification of polymers through RIG is durable as it covalently bonds the functional monomers. As radiation penetration depths can be varied, this technique can be used to modify polymer surface or bulk. The many parameters influencing RIG that control the yield of the grafting process are discussed in this review. These include monomer reactivity, irradiation dose, solvent, presence of inhibitor of homopolymerization, grafting temperature, etc. Today, the general knowledge of RIG can be applied to any solid polymer and may predict, to some extent, the grafting location. A special focus is on how ionizing radiation sources (ion and electron beams, UVs) may be chosen or mixed to combine both solid polymer nanostructuration and RIG. LLET ionizing radiation has also been extensively used to synthesize hydrogel and nanogel for drug delivery systems and other advanced applications. In particular, nanogels can either be produced by radiation-induced polymerization and simultaneous crosslinking of hydrophilic monomers in “nanocompartments”, i.e., within the aqueous phase of inverse micelles, or by intramolecular crosslinking of suitable water-soluble polymers. The radiolytically produced oxidizing species from water, •OH radicals, can easily abstract H-atoms from the backbone of the dissolved polymers (or can add to the unsaturated bonds) leading to the formation of C-centered radicals. These C-centered free radicals can undergo two main competitive reactions; intramolecular and intermolecular crosslinking. When produced by electron beam irradiation, higher temperatures, dose rates within the pulse, and pulse repetition rates favour intramolecular crosslinking over intermolecular crosslinking, thus enabling a better control of particle size and size distribution. For other water-soluble biopolymers such as polysaccharides, proteins, DNA and RNA, the abstraction of H atoms or the addition to the unsaturation by •OH can lead to the direct scission of the backbone, double, or single strand breaks of these polymers
    corecore