169 research outputs found

    Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems

    Full text link
    We evaluate optimized parallel sparse matrix-vector operations for several representative application areas on widespread multicore-based cluster configurations. First the single-socket baseline performance is analyzed and modeled with respect to basic architectural properties of standard multicore chips. Beyond the single node, the performance of parallel sparse matrix-vector operations is often limited by communication overhead. Starting from the observation that nonblocking MPI is not able to hide communication cost using standard MPI implementations, we demonstrate that explicit overlap of communication and computation can be achieved by using a dedicated communication thread, which may run on a virtual core. Moreover we identify performance benefits of hybrid MPI/OpenMP programming due to improved load balancing even without explicit communication overlap. We compare performance results for pure MPI, the widely used "vector-like" hybrid programming strategies, and explicit overlap on a modern multicore-based cluster and a Cray XE6 system.Comment: 16 pages, 10 figure

    Benchmarking computer platforms for lattice QCD applications

    Full text link
    We define a benchmark suite for lattice QCD and report on benchmark results from several computer platforms. The platforms considered are apeNEXT, CRAY T3E, Hitachi SR8000, IBM p690, PC-Clusters, and QCDOC.Comment: 3 pages, Lattice03, machines and algorithm

    A Scheme to Numerically Evolve Data for the Conformal Einstein Equation

    Get PDF
    This is the second paper in a series describing a numerical implementation of the conformal Einstein equation. This paper deals with the technical details of the numerical code used to perform numerical time evolutions from a "minimal" set of data. We outline the numerical construction of a complete set of data for our equations from a minimal set of data. The second and the fourth order discretisations, which are used for the construction of the complete data set and for the numerical integration of the time evolution equations, are described and their efficiencies are compared. By using the fourth order scheme we reduce our computer resource requirements --- with respect to memory as well as computation time --- by at least two orders of magnitude as compared to the second order scheme.Comment: 20 pages, 12 figure

    Intraoperative radiotherapy during awake craniotomies: preliminary results of a single-center case series

    Get PDF
    Awake craniotomies are performed to avoid postoperative neurological deficits when resecting lesions in the eloquent cortex, especially the speech area. Intraoperative radiotherapy (IORT) has recently focused on optimizing the oncological treatment of primary malignant brain tumors and metastases. Herein, for the first time, we present preliminary results of IORT in the setting of awake craniotomies. From 2021 to 2022, all patients undergoing awake craniotomies for tumor resection combined with IORT were analyzed retrospectively. Demographical and clinical data, operative procedure, and treatment-related complications were evaluated. Five patients were identified (age (mean ± standard deviation (SD): 65 ± 13.5 years (y)). A solid left frontal metastasis was detected in the first patient (female, 49 y). The second patient (male, 72 y) presented with a solid metastasis on the left parietal lobe. The third patient (male, 52 y) was diagnosed with a left temporoparietal metastasis. Patient four (male, 74 y) was diagnosed with a high-grade glioma on the left frontal lobe. A metastasis on the left temporooccipital lobe was detected in the fifth patient (male, 78 y). After awake craniotomy and macroscopic complete tumor resection, intraoperative tumor bed irradiation was carried out with 50 kV x-rays and a total of 20 Gy for 16.7 ± 2.5 min. During a mean follow-up of 6.3 ± 2.6 months, none of the patients developed any surgery- or IORT-related complications or disabling permanent neurological deficits. Intraoperative radiotherapy in combination with awake craniotomy seems to be feasible and safe

    Kinematics of Multigrid Monte Carlo

    Full text link
    We study the kinematics of multigrid Monte Carlo algorithms by means of acceptance rates for nonlocal Metropolis update proposals. An approximation formula for acceptance rates is derived. We present a comparison of different coarse-to-fine interpolation schemes in free field theory, where the formula is exact. The predictions of the approximation formula for several interacting models are well confirmed by Monte Carlo simulations. The following rule is found: For a critical model with fundamental Hamiltonian H(phi), absence of critical slowing down can only be expected if the expansion of in terms of the shift psi contains no relevant (mass) term. We also introduce a multigrid update procedure for nonabelian lattice gauge theory and study the acceptance rates for gauge group SU(2) in four dimensions.Comment: 28 pages, 8 ps-figures, DESY 92-09

    Dynamical Wilson fermions and the problem of the chiral limit in compact lattice QED

    Get PDF
    We compare the approach to the chiral transition line ~\kappa_c(\bt)~ in quenched and full compact lattice QED with Wilson fermions within the confinement phase, especially in the pseudoscalar sector of the theory. We show that in the strong coupling limit (β=0\beta =0) the quenched theory is a good approximation to the full one, in contrast to the case of β=0.8\beta =0.8. At the larger β\beta-value the transition in the full theory is inconsistent with the zero--mass limit of the pseudoscalar particle, thus prohibiting the definition of a chiral limit.Comment: 13 pages LaTeX (epsf), all figures include

    Dynamics of Monopoles and Flux Tubes in Two-Flavor Dynamical QCD

    Get PDF
    We investigate the confining properties of the QCD vacuum with Nf=2N_f=2 flavors of dynamical quarks, and compare the results with the properties of the quenched theory. We use non-perturbatively O(a)\mathcal{O}(a) improved Wilson fermions to keep cut-off effects small. We focus on color magnetic monopoles. Among the quantities we study are the monopole density and the monopole screening length, the static potential and the profile of the color electric flux tube. We furthermore derive the low-energy effective monopole action. Marked differences between the quenched and dynamical vacuum are found.Comment: 34 pages, 28 figures, Late

    A numerical reinvestigation of the Aoki phase with N_f=2 Wilson fermions at zero temperature

    Get PDF
    We report on a numerical reinvestigation of the Aoki phase in lattice QCD with two flavors of Wilson fermions where the parity-flavor symmetry is spontaneously broken. For this purpose an explicitly symmetry-breaking source term hψˉiγ5τ3ψh\bar{\psi} i \gamma_{5} \tau^{3}\psi was added to the fermion action. The order parameter was computed with the Hybrid Monte Carlo algorithm at several values of (β,κ,h)(\beta,\kappa,h) on lattices of sizes 444^4 to 12412^4 and extrapolated to h=0h=0. The existence of a parity-flavor breaking phase can be confirmed at β=4.0\beta=4.0 and 4.3, while we do not find parity-flavor breaking at β=4.6\beta=4.6 and 5.0.Comment: 8 pages, 5 figures, Revised version as to be published in Phys.Rev.
    • …
    corecore