170 research outputs found
Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems
We evaluate optimized parallel sparse matrix-vector operations for several
representative application areas on widespread multicore-based cluster
configurations. First the single-socket baseline performance is analyzed and
modeled with respect to basic architectural properties of standard multicore
chips. Beyond the single node, the performance of parallel sparse matrix-vector
operations is often limited by communication overhead. Starting from the
observation that nonblocking MPI is not able to hide communication cost using
standard MPI implementations, we demonstrate that explicit overlap of
communication and computation can be achieved by using a dedicated
communication thread, which may run on a virtual core. Moreover we identify
performance benefits of hybrid MPI/OpenMP programming due to improved load
balancing even without explicit communication overlap. We compare performance
results for pure MPI, the widely used "vector-like" hybrid programming
strategies, and explicit overlap on a modern multicore-based cluster and a Cray
XE6 system.Comment: 16 pages, 10 figure
Benchmarking computer platforms for lattice QCD applications
We define a benchmark suite for lattice QCD and report on benchmark results
from several computer platforms. The platforms considered are apeNEXT, CRAY
T3E, Hitachi SR8000, IBM p690, PC-Clusters, and QCDOC.Comment: 3 pages, Lattice03, machines and algorithm
A Scheme to Numerically Evolve Data for the Conformal Einstein Equation
This is the second paper in a series describing a numerical implementation of
the conformal Einstein equation. This paper deals with the technical details of
the numerical code used to perform numerical time evolutions from a "minimal"
set of data.
We outline the numerical construction of a complete set of data for our
equations from a minimal set of data. The second and the fourth order
discretisations, which are used for the construction of the complete data set
and for the numerical integration of the time evolution equations, are
described and their efficiencies are compared. By using the fourth order scheme
we reduce our computer resource requirements --- with respect to memory as well
as computation time --- by at least two orders of magnitude as compared to the
second order scheme.Comment: 20 pages, 12 figure
Intraoperative radiotherapy during awake craniotomies: preliminary results of a single-center case series
Awake craniotomies are performed to avoid postoperative neurological deficits when resecting lesions in the eloquent cortex, especially the speech area. Intraoperative radiotherapy (IORT) has recently focused on optimizing the oncological treatment of primary malignant brain tumors and metastases. Herein, for the first time, we present preliminary results of IORT in the setting of awake craniotomies. From 2021 to 2022, all patients undergoing awake craniotomies for tumor resection combined with IORT were analyzed retrospectively. Demographical and clinical data, operative procedure, and treatment-related complications were evaluated. Five patients were identified (age (mean ± standard deviation (SD): 65 ± 13.5 years (y)). A solid left frontal metastasis was detected in the first patient (female, 49 y). The second patient (male, 72 y) presented with a solid metastasis on the left parietal lobe. The third patient (male, 52 y) was diagnosed with a left temporoparietal metastasis. Patient four (male, 74 y) was diagnosed with a high-grade glioma on the left frontal lobe. A metastasis on the left temporooccipital lobe was detected in the fifth patient (male, 78 y). After awake craniotomy and macroscopic complete tumor resection, intraoperative tumor bed irradiation was carried out with 50 kV x-rays and a total of 20 Gy for 16.7 ± 2.5 min. During a mean follow-up of 6.3 ± 2.6 months, none of the patients developed any surgery- or IORT-related complications or disabling permanent neurological deficits. Intraoperative radiotherapy in combination with awake craniotomy seems to be feasible and safe
Kinematics of Multigrid Monte Carlo
We study the kinematics of multigrid Monte Carlo algorithms by means of
acceptance rates for nonlocal Metropolis update proposals. An approximation
formula for acceptance rates is derived. We present a comparison of different
coarse-to-fine interpolation schemes in free field theory, where the formula is
exact. The predictions of the approximation formula for several interacting
models are well confirmed by Monte Carlo simulations. The following rule is
found: For a critical model with fundamental Hamiltonian H(phi), absence of
critical slowing down can only be expected if the expansion of
in terms of the shift psi contains no relevant (mass) term. We also introduce a
multigrid update procedure for nonabelian lattice gauge theory and study the
acceptance rates for gauge group SU(2) in four dimensions.Comment: 28 pages, 8 ps-figures, DESY 92-09
Dynamical Wilson fermions and the problem of the chiral limit in compact lattice QED
We compare the approach to the chiral transition line ~\kappa_c(\bt)~ in
quenched and full compact lattice QED with Wilson fermions within the
confinement phase, especially in the pseudoscalar sector of the theory. We show
that in the strong coupling limit () the quenched theory is a good
approximation to the full one, in contrast to the case of . At the
larger -value the transition in the full theory is inconsistent with the
zero--mass limit of the pseudoscalar particle, thus prohibiting the definition
of a chiral limit.Comment: 13 pages LaTeX (epsf), all figures include
Dynamics of Monopoles and Flux Tubes in Two-Flavor Dynamical QCD
We investigate the confining properties of the QCD vacuum with
flavors of dynamical quarks, and compare the results with the properties of the
quenched theory. We use non-perturbatively improved Wilson
fermions to keep cut-off effects small. We focus on color magnetic monopoles.
Among the quantities we study are the monopole density and the monopole
screening length, the static potential and the profile of the color electric
flux tube. We furthermore derive the low-energy effective monopole action.
Marked differences between the quenched and dynamical vacuum are found.Comment: 34 pages, 28 figures, Late
A numerical reinvestigation of the Aoki phase with N_f=2 Wilson fermions at zero temperature
We report on a numerical reinvestigation of the Aoki phase in lattice QCD
with two flavors of Wilson fermions where the parity-flavor symmetry is
spontaneously broken. For this purpose an explicitly symmetry-breaking source
term was added to the fermion action.
The order parameter was computed with
the Hybrid Monte Carlo algorithm at several values of on
lattices of sizes to and extrapolated to . The existence of a
parity-flavor breaking phase can be confirmed at and 4.3, while we
do not find parity-flavor breaking at and 5.0.Comment: 8 pages, 5 figures, Revised version as to be published in Phys.Rev.
- …