Search CORE

1,464 research outputs found

High-Performance Simulations of Coherent Synchrotron Radiation on Multicore GPU and CPU Platforms

Author: Arumugam K.
Godunov A.
Ranjan D.
Terzić B.
Zubair M.
Publication venue: ODU Digital Commons
Publication date: 01/01/2015
Field of study

Coherent synchrotron radiation (CSR) is an effect of self-interaction of an electron bunch as it traverses a curved path. It can cause a significant emittance degradation and microbunching. We present a new high-performance 2D, particle-in-cell code which uses massively parallel multicore GPU/GPU platforms to alleviate computational bottlenecks. The code formulates the CSR problem from first principles by using the retarded scalar and vector potentials to compute the self-interaction fields. The speedup due to the parallel implementation on GPU/CPU platforms exceeds three orders of magnitude, thereby bringing a previously intractable problem within reach. The accuracy of the code is verified against analytic 1D solutions (rigid bunch)

Old Dominion University

Towards quantum simulation with circular Rydberg atoms

Author: Assemat Fédéric
Brune Michel
Cantat-Moltrecht Tigrane
Cortinas Rodrigo
Dotsenko Igor
Gleyzes Sébastien
Haroche Serge
Jolicoeur Thierry
Nguyen Thanh Long
Raimond Jean-Michel
Roux Guillaume
Sayrin Clément
Publication venue: 'American Physical Society (APS)'
Publication date: 01/02/2018
Field of study

The main objective of quantum simulation is an in-depth understanding of many-body physics. It is important for fundamental issues (quantum phase transitions, transport, . . . ) and for the development of innovative materials. Analytic approaches to many-body systems are limited and the huge size of their Hilbert space makes numerical simulations on classical computers intractable. A quantum simulator avoids these limitations by transcribing the system of interest into another, with the same dynamics but with interaction parameters under control and with experimental access to all relevant observables. Quantum simulation of spin systems is being explored with trapped ions, neutral atoms and superconducting devices. We propose here a new paradigm for quantum simulation of spin-1/2 arrays providing unprecedented flexibility and allowing one to explore domains beyond the reach of other platforms. It is based on laser-trapped circular Rydberg atoms. Their long intrinsic lifetimes combined with the inhibition of their microwave spontaneous emission and their low sensitivity to collisions and photoionization make trapping lifetimes in the minute range realistic with state-of-the-art techniques. Ultra-cold defect-free circular atom chains can be prepared by a variant of the evaporative cooling method. This method also leads to the individual detection of arbitrary spin observables. The proposed simulator realizes an XXZ spin-1/2 Hamiltonian with nearest-neighbor couplings ranging from a few to tens of kHz. All the model parameters can be tuned at will, making a large range of simulations accessible. The system evolution can be followed over times in the range of seconds, long enough to be relevant for ground-state adiabatic preparation and for the study of thermalization, disorder or Floquet time crystals. This platform presents unrivaled features for quantum simulation

arXiv.org e-Print Archive

Directory of Open Access Journals

Hal-Diderot

Simulations of Coherent Synchrotron Radiation on Parallel Hybrid GPU/CPU Platform

Author: Arumugam K.
Duffin D.
Godunov A.
Islam T.
Ranjan D.
Sangam S.
Terzić B.
Zubair Mohammad
Publication venue: ODU Digital Commons
Publication date: 01/01/2017
Field of study

Coherent synchrotron radiation (CSR) is an effect of self-interaction of an electron bunch as it traverses a curved path. It can cause a significant emittance degradation, as well as fragmentation and microbunching. Numerical simulations of the 2D/3D CSR effects have been extremely challenging due to computational bottlenecks associated with calculating retarded potentials via integrating over the history of the bunch. We present a new high-performance 2D, particle-in-cell code which uses massively parallel multicore GPU/GPU platforms to alleviate computational bottlenecks. The code formulates the CSR problem from first principles by using the retarded scalar and vector potentials to compute the self-interaction fields. The speedup due to the parallel implementation on GPU/CPU platforms exceeds three orders of magnitude, thereby bringing a previously intractable problem within reach. The accuracy of the code is verified against analytic 1D solutions (rigid bunch) and semi-analytic 2D solutions for the chirped bunch. Finally, we use the new code in conjunction with a genetic algorithm to optimize the design of a fiducial chicane

Old Dominion University

Efficient Machine Learning Approach for Optimizing Scientific Computing Applications on Emerging HPC Architectures

Author: Karunanithi Kamesh Arumugam
Publication venue: ODU Digital Commons
Publication date: 01/10/2017
Field of study

Efficient parallel implementations of scientific applications on multi-core CPUs with accelerators such as GPUs and Xeon Phis is challenging. This requires - exploiting the data parallel architecture of the accelerator along with the vector pipelines of modern x86 CPU architectures, load balancing, and efficient memory transfer between different devices. It is relatively easy to meet these requirements for highly-structured scientific applications. In contrast, a number of scientific and engineering applications are unstructured. Getting performance on accelerators for these applications is extremely challenging because many of these applications employ irregular algorithms which exhibit data-dependent control-flow and irregular memory accesses. Furthermore, these applications are often iterative with dependency between steps, and thus making it hard to parallelize across steps. As a result, parallelism in these applications is often limited to a single step. Numerical simulation of charged particles beam dynamics is one such application where the distribution of work and memory access pattern at each time step is irregular. Applications with these properties tend to present significant branch and memory divergence, load imbalance between different processor cores, and poor compute and memory utilization. Prior research on parallelizing such irregular applications have been focused around optimizing the irregular, data-dependent memory accesses and control-flow during a single step of the application independent of the other steps, with the assumption that these patterns are completely unpredictable. We observed that the structure of computation leading to control-flow divergence and irregular memory accesses in one step is similar to that in the next step. It is possible to predict this structure in the current step by observing the computation structure of previous steps. In this dissertation, we present novel machine learning based optimization techniques to address the parallel implementation challenges of such irregular applications on different HPC architectures. In particular, we use supervised learning to predict the computation structure and use it to address the control-flow and memory access irregularities in the parallel implementation of such applications on GPUs, Xeon Phis, and heterogeneous architectures composed of multi-core CPUs with GPUs or Xeon Phis. We use numerical simulation of charged particles beam dynamics simulation as a motivating example throughout the dissertation to present our new approach, though they should be equally applicable to a wide range of irregular applications. The machine learning approach presented here use predictive analytics and forecasting techniques to adaptively model and track the irregular memory access pattern at each time step of the simulation to anticipate the future memory access pattern. Access pattern forecasts can then be used to formulate optimization decisions during application execution which improves the performance of the application at a future time step based on the observations from earlier time steps. In heterogeneous architectures, forecasts can also be used to improve the memory performance and resource utilization of all the processing units to deliver a good aggregate performance. We used these optimization techniques and anticipation strategy to design a cache-aware, memory efficient parallel algorithm to address the irregularities in the parallel implementation of charged particles beam dynamics simulation on different HPC architectures. Experimental result using a diverse mix of HPC architectures shows that our approach in using anticipation strategy is effective in maximizing data reuse, ensuring workload balance, minimizing branch and memory divergence, and in improving resource utilization

Old Dominion University

Neural Networks for Modeling and Control of Particle Accelerators

Author: Biedron S. G.
Chase B. E.
Edelen A. L.
Edstrom D.
Milton S. V.
Stabile P.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/10/2016
Field of study

We describe some of the challenges of particle accelerator control, highlight recent advances in neural network techniques, discuss some promising avenues for incorporating neural networks into particle accelerator control systems, and describe a neural network-based control system that is being developed for resonance control of an RF electron gun at the Fermilab Accelerator Science and Technology (FAST) facility, including initial experimental results from a benchmark controller.Comment: 21 p

arXiv.org e-Print Archive

CERN Document Server

Technologies for trapped-ion quantum information systems

Author: Bylinskii Alexei
Chuang Isaac L.
Eltony Amira M.
Gangloff Dorian
Shi Molu
Vuletić Vladan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/03/2016
Field of study

Scaling-up from prototype systems to dense arrays of ions on chip, or vast networks of ions connected by photonic channels, will require developing entirely new technologies that combine miniaturized ion trapping systems with devices to capture, transmit and detect light, while refining how ions are confined and controlled. Building a cohesive ion system from such diverse parts involves many challenges, including navigating materials incompatibilities and undesired coupling between elements. Here, we review our recent efforts to create scalable ion systems incorporating unconventional materials such as graphene and indium tin oxide, integrating devices like optical fibers and mirrors, and exploring alternative ion loading and trapping techniques.Comment: 19 pages, 18 figure

arXiv.org e-Print Archive

DSpace@MIT

Coupled Dynamics of Spin Qubits in Optical Dipole Microtraps: Application to the Error Analysis of a Rydberg-Blockade Gate

Author: Gerasimov L. V.
Kulik S. P.
Kupriyanov D. V.
Moiseevsky A. D.
Straupe S. S.
Sukenik Charles I.
Tikhonov K. S.
Vybornyi I.
Yusupov R. R.
Publication venue: ODU Digital Commons
Publication date: 01/01/2022
Field of study

Single atoms in dipole microtraps or optical tweezers have recently become a promising platform for quantum computing and simulation. Here we report a detailed theoretical analysis of the physics underlying an implementation of a Rydberg two-qubit gate in such a system—a cornerstone protocol in quantum computing with single atoms. We focus on a blockade-type entangling gate and consider various decoherence processes limiting its performance in a real system. We provide numerical estimates for the limits on fidelity of the maximally entangled states and predict the full process matrix corresponding to the noisy two-qubit gate. We consider different excitation geometries and show certain advantages for the gate realization with linearly polarized driving beams. Our methods and results may find implementation in numerical models for simulation and optimization of neutral atom based quantum processors

Old Dominion University

ASCR/HEP Exascale Requirements Review Report

This draft report summarizes and details the findings, results, and recommendations derived from the ASCR/HEP Exascale Requirements Review meeting held in June, 2015. The main conclusions are as follows. 1) Larger, more capable computing and data facilities are needed to support HEP science goals in all three frontiers: Energy, Intensity, and Cosmic. The expected scale of the demand at the 2025 timescale is at least two orders of magnitude -- and in some cases greater -- than that available currently. 2) The growth rate of data produced by simulations is overwhelming the current ability, of both facilities and researchers, to store and analyze it. Additional resources and new techniques for data analysis are urgently needed. 3) Data rates and volumes from HEP experimental facilities are also straining the ability to store and analyze large and complex data volumes. Appropriately configured leadership-class facilities can play a transformational role in enabling scientific discovery from these datasets. 4) A close integration of HPC simulation and data analysis will aid greatly in interpreting results from HEP experiments. Such an integration will minimize data movement and facilitate interdependent workflows. 5) Long-range planning between HEP and ASCR will be required to meet HEP's research needs. To best use ASCR HPC resources the experimental HEP program needs a) an established long-term plan for access to ASCR computational and data resources, b) an ability to map workflows onto HPC resources, c) the ability for ASCR facilities to accommodate workflows run by collaborations that can have thousands of individual members, d) to transition codes to the next-generation HPC platforms that will be available at ASCR facilities, e) to build up and train a workforce capable of developing and using simulations and analysis to support HEP scientific research on next-generation systems.Comment: 77 pages, 13 Figures; draft report, subject to further revisio

arXiv.org e-Print Archive

eScholarship - University of California