408 research outputs found
Multi-GPU Acceleration of the iPIC3D Implicit Particle-in-Cell Code
iPIC3D is a widely used massively parallel Particle-in-Cell code for the
simulation of space plasmas. However, its current implementation does not
support execution on multiple GPUs. In this paper, we describe the porting of
iPIC3D particle mover to GPUs and the optimization steps to increase the
performance and parallel scaling on multiple GPUs. We analyze the strong
scaling of the mover on two GPU clusters and evaluate its performance and
acceleration. The optimized GPU version which uses pinned memory and
asynchronous data prefetching outperform their corresponding CPU versions by
5-10x on two different systems equipped with NVIDIA K80 and V100 GPUs.Comment: Accepted for publication in ICCS 201
A note on the O(n)-storage implementation of the GKO algorithm
We propose a new O(n)-space implementation of the GKO-Cauchy algorithm for
the solution of linear systems with Cauchy-like matrix. Despite its slightly
higher computational cost, this new algorithm makes a more efficient use of the
processor cache memory. Thus, for matrices of size larger than about 500-1000,
it outperforms the existing algorithms.
We present an applicative case of Cauchy-like matrices with
non-reconstructible main diagonal. In this special instance, the O(n) space
algorithms can be adapted nicely to provide an efficient implementation of
basic linear algebra operations in terms of the low displacement-rank
generators
Electroweak Working Group Report
The report summarizes the results of the activities of the Working Group on
Precision Calculations for the Z Resonance at CERN during 1994.Comment: 158 Latex, including 37 figures (27 eps figures), 30 tables; The
report is part of: D. Bardin, W. Hollik, G. Passarino (eds.), Reports of the
working group on precision calculations for the Z resonance, CERN 95-03 (31
March 1995
Solar BaII 4554 A line as Doppler diagnostics: NLTE analysis in 3D hydrodynamical model
The aim of this paper is to analyse the validity of the Dopplergram and
lambda-meter techniques for the Doppler diagnostics of solar photospheric
velocities using the BaII 4554 A line. Both techniques are evaluated by means
of NLTE radiative transfer calculations of the BaII 4554 A line in a
three-dimensional hydrodynamical model of solar convection. We consider the
cases of spatially unsmeared profiles and the profiles smeared to the
resolution of ground-based observations. We find that: (i)
Speckle-reconstructed Dopplergram velocities reproduce the ``true'' velocities
well at heights around 300 km, except for intergranular lanes with strong
downflows where the velocity can be overestimated. (ii) The lambda-meter
velocities give a good representation of the ``true'' velocities through the
whole photosphere, both under the original and reduced spatial resolutions. The
velocities derived from the inner wing of smeared BaII 4554 A line profiles are
more reliable than those for the outer wing. Only under high spatial resolution
does the inner wing velocities calculated in intergranular regions give an
underestimate (or even a sign reversal) compared with the model velocities.
(iii) NLTE effects should be taken into account in modelling the BaII 4554 A
line profiles. Such effects are more pronounced in intergranular regions. Our
analysis supports the opinion that the Dopplergram technique applied to the
BaII 4554 A line is a valuable tool for the Doppler diagnostics of the middle
photosphere around 300 km. The \lambda-meter technique applied to this line
gives us a good opportunity to ``trace'' the non-thermal motions along the
whole photosphere up to the temperature minimum and lower chromosphere.Comment: accepted by Astronomy and Astrophysic
Four-Fermion Production in Electron-Positron Collisions
This report summarises the results of the four-fermion working group of the
LEP2-MC workshop, held at CERN from 1999 to 2000. Recent developments in the
calculation of four-fermion processes in electron-positron collisions at LEP-2
centre-of-mass energies are presented, concentrating on predictions for four
main reactions: W-pair production, visible photons in four-fermion events,
single-W production and Z-pair production. Based on a comparison of results
derived within different approaches, theoretical uncertainties on these
predictions are established.Comment: 150 pages, 73 figures, 45 table
Determination of the muon charge sign with the dipolar spectrometers of the OPERA experiment
The OPERA long-baseline neutrino-oscillation experiment has observed the
direct appearance of in the CNGS beam. Two large muon
magnetic spectrometers are used to identify muons produced in the
leptonic decay and in interactions by measuring their charge and
momentum. Besides the kinematic analysis of the decays, background
resulting from the decay of charmed particles produced in
interactions is reduced by efficiently identifying the muon track. A new method
for the charge sign determination has been applied, via a weighted angular
matching of the straight track-segments reconstructed in the different parts of
the dipole magnets. Results obtained for Monte Carlo and real data are
presented. Comparison with a method where no matching is used shows a
significant reduction of up to 40\% of the fraction of wrongly determined
charges.Comment: 10 pages. Improvements in the tex
Procedure for short-lived particle detection in the OPERA experiment and its application to charm decays
The OPERA experiment, designed to perform the first observation of oscillations in appearance mode through the detection of
the leptons produced in charged current interactions, has
collected data from 2008 to 2012. In the present paper, the procedure developed
to detect particle decays, occurring over distances of the order of 1 mm
from the neutrino interaction point, is described in detail. The results of its
application to the search for charmed hadrons are then presented as a
validation of the methods for appearance detection
- …