603 research outputs found

    Insights into the Fallback Path of Best-Effort Hardware Transactional Memory Systems

    Get PDF
    DOI 10.1007/978-3-319-43659-3Current industry proposals for Hardware Transactional Memory (HTM) focus on best-effort solutions (BE-HTM) where hardware limits are imposed on transactions. These designs may show a significant performance degradation due to high contention scenarios and different hardware and operating system limitations that abort transactions, e.g. cache overflows, hardware and software exceptions, etc. To deal with these events and to ensure forward progress, BE-HTM systems usually provide a software fallback path to execute a lock-based version of the code. In this paper, we propose a hardware implementation of an irrevocability mechanism as an alternative to the software fallback path to gain insight into the hardware improvements that could enhance the execution of such a fallback. Our mechanism anticipates the abort that causes the transaction serialization, and stalls other transactions in the system so that transactional work loss is mini- mized. In addition, we evaluate the main software fallback path approaches and propose the use of ticket locks that hold precise information of the number of transactions waiting to enter the fallback. Thus, the separation of transactional and fallback execution can be achieved in a precise manner. The evaluation is carried out using the Simics/GEMS simulator and the complete range of STAMP transactional suite benchmarks. We obtain significant performance benefits of around twice the speedup and an abort reduction of 50% over the software fallback path for a number of benchmarks.Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech

    Admit your weakness: Verifying correctness on TSO architectures

    Get PDF
    “The final publication is available at http://link.springer.com/chapter/10.1007%2F978-3-319-15317-9_22 ”.Linearizability has become the standard correctness criterion for fine-grained non-atomic concurrent algorithms, however, most approaches assume a sequentially consistent memory model, which is not always realised in practice. In this paper we study the correctness of concurrent algorithms on a weak memory model: the TSO (Total Store Order) memory model, which is commonly implemented by multicore architectures. Here, linearizability is often too strict, and hence, we prove a weaker criterion, quiescent consistency instead. Like linearizability, quiescent consistency is compositional making it an ideal correctness criterion in a component-based context. We demonstrate how to model a typical concurrent algorithm, seqlock, and prove it quiescent consistent using a simulation-based approach. Previous approaches to proving correctness on TSO architectures have been based on linearizabilty which makes it necessary to modify the algorithm’s high-level requirements. Our approach is the first, to our knowledge, for proving correctness without the need for such a modification

    A wide-spectrum language for verification of programs on weak memory models

    Full text link
    Modern processors deploy a variety of weak memory models, which for efficiency reasons may (appear to) execute instructions in an order different to that specified by the program text. The consequences of instruction reordering can be complex and subtle, and can impact on ensuring correctness. Previous work on the semantics of weak memory models has focussed on the behaviour of assembler-level programs. In this paper we utilise that work to extract some general principles underlying instruction reordering, and apply those principles to a wide-spectrum language encompassing abstract data types as well as low-level assembler code. The goal is to support reasoning about implementations of data structures for modern processors with respect to an abstract specification. Specifically, we define an operational semantics, from which we derive some properties of program refinement, and encode the semantics in the rewriting engine Maude as a model-checking tool. The tool is used to validate the semantics against the behaviour of a set of litmus tests (small assembler programs) run on hardware, and also to model check implementations of data structures from the literature against their abstract specifications

    Evaluating Molecular Mechanical Potentials for Helical Peptides and Proteins

    Get PDF
    Multiple variants of the AMBER all-atom force field were quantitatively evaluated with respect to their ability to accurately characterize helix-coil equilibria in explicit solvent simulations. Using a global distributed computing network, absolute conformational convergence was achieved for large ensembles of the capped A21 and Fs helical peptides. Further assessment of these AMBER variants was conducted via simulations of a flexible 164-residue five-helix-bundle protein, apolipophorin-III, on the 100 ns timescale. Of the contemporary potentials that had not been assessed previously, the AMBER-99SB force field showed significant helix-destabilizing tendencies, with beta bridge formation occurring in helical peptides, and unfolding of apolipophorin-III occurring on the tens of nanoseconds timescale. The AMBER-03 force field, while showing adequate helical propensities for both peptides and stabilizing apolipophorin-III, (i) predicts an unexpected decrease in helicity with ALA→ARG+ substitution, (ii) lacks experimentally observed 310 helical content, and (iii) deviates strongly from average apolipophorin-III NMR structural properties. As is observed for AMBER-99SB, AMBER-03 significantly overweighs the contribution of extended and polyproline backbone configurations to the conformational equilibrium. In contrast, the AMBER-99φ force field, which was previously shown to best reproduce experimental measurements of the helix-coil transition in model helical peptides, adequately stabilizes apolipophorin-III and yields both an average gyration radius and polar solvent exposed surface area that are in excellent agreement with the NMR ensemble

    Studying the Underlying Event in Drell-Yan and High Transverse Momentum Jet Production at the Tevatron

    Get PDF
    We study the underlying event in proton-antiproton collisions by examining the behavior of charged particles (transverse momentum pT > 0.5 GeV/c, pseudorapidity |\eta| < 1) produced in association with large transverse momentum jets (~2.2 fb-1) or with Drell-Yan lepton-pairs (~2.7 fb-1) in the Z-boson mass region (70 < M(pair) < 110 GeV/c2) as measured by CDF at 1.96 TeV center-of-mass energy. We use the direction of the lepton-pair (in Drell-Yan production) or the leading jet (in high-pT jet production) in each event to define three regions of \eta-\phi space; toward, away, and transverse, where \phi is the azimuthal scattering angle. For Drell-Yan production (excluding the leptons) both the toward and transverse regions are very sensitive to the underlying event. In high-pT jet production the transverse region is very sensitive to the underlying event and is separated into a MAX and MIN transverse region, which helps separate the hard component (initial and final-state radiation) from the beam-beam remnant and multiple parton interaction components of the scattering. The data are corrected to the particle level to remove detector effects and are then compared with several QCD Monte-Carlo models. The goal of this analysis is to provide data that can be used to test and improve the QCD Monte-Carlo models of the underlying event that are used to simulate hadron-hadron collisions.Comment: Submitted to Phys.Rev.

    Forward-Backward Asymmetry in Top Quark Production in ppbar Collisions at sqrt{s}=1.96 TeV

    Get PDF
    Reconstructable final state kinematics and charge assignment in the reaction ppbar->ttbar allows tests of discrete strong interaction symmetries at high energy. We define frame dependent forward-backward asymmetries for the outgoing top quark in both the ppbar and ttbar rest frames, correct for experimental distortions, and derive values at the parton-level. Using 1.9/fb of ppbar collisions at sqrt{s}=1.96 TeV recorded with the CDF II detector at the Fermilab Tevatron, we measure forward-backward top quark production asymmetries in the ppbar and ttbar rest frames of A_{FB,pp} = 0.17 +- 0.08 and A_{FB,tt} = 0.24 +- 0.14.Comment: 7 pages, 2 figures, submitted to Phys.Rev.Lett, corrected references and change of tex

    Measurement of the Forward-Backward Asymmetry in the B -> K(*) mu+ mu- Decay and First Observation of the Bs -> phi mu+ mu- Decay

    Get PDF
    We reconstruct the rare decays B+K+μ+μB^+ \to K^+\mu^+\mu^-, B0K(892)0μ+μB^0 \to K^{*}(892)^0\mu^+\mu^-, and Bs0ϕ(1020)μ+μB^0_s \to \phi(1020)\mu^+\mu^- in a data sample corresponding to 4.4fb14.4 {\rm fb^{-1}} collected in ppˉp\bar{p} collisions at s=1.96TeV\sqrt{s}=1.96 {\rm TeV} by the CDF II detector at the Fermilab Tevatron Collider. Using 121±16121 \pm 16 B+K+μ+μB^+ \to K^+\mu^+\mu^- and 101±12101 \pm 12 B0K0μ+μB^0 \to K^{*0}\mu^+\mu^- decays we report the branching ratios. In addition, we report the measurement of the differential branching ratio and the muon forward-backward asymmetry in the B+B^+ and B0B^0 decay modes, and the K0K^{*0} longitudinal polarization in the B0B^0 decay mode with respect to the squared dimuon mass. These are consistent with the theoretical prediction from the standard model, and most recent determinations from other experiments and of comparable accuracy. We also report the first observation of the Bs0ϕμ+μdecayandmeasureitsbranchingratioB^0_s \to \phi\mu^+\mu^- decay and measure its branching ratio {\mathcal{B}}(B^0_s \to \phi\mu^+\mu^-) = [1.44 \pm 0.33 \pm 0.46] \times 10^{-6}using using 27 \pm 6signalevents.Thisiscurrentlythemostrare signal events. This is currently the most rare B^0_s$ decay observed.Comment: 7 pages, 2 figures, 3 tables. Submitted to Phys. Rev. Let

    Measurement of the W+WW^+W^- Production Cross Section and Search for Anomalous WWγWW\gamma and WWZWWZ Couplings in ppˉp \bar p Collisions at s=1.96\sqrt{s} = 1.96 TeV

    Get PDF
    This Letter describes the current most precise measurement of the WW boson pair production cross section and most sensitive test of anomalous WWγWW\gamma and WWZWWZ couplings in ppˉp \bar p collisions at a center-of-mass energy of 1.96 TeV. The WWWW candidates are reconstructed from decays containing two charged leptons and two neutrinos, where the charged leptons are either electrons or muons. Using data collected by the CDF II detector from 3.6 fb1^{-1} of integrated luminosity, a total of 654 candidate events are observed with an expected background contribution of 320±47320 \pm 47 events. The measured total cross section is σ(ppˉW+W+X)=12.1±0.9(stat)1.4+1.6(syst)\sigma (p \bar p \to W^+ W^- + X) = 12.1 \pm 0.9 \textrm{(stat)} ^{+1.6}_{-1.4} \textrm{(syst)} pb, which is in good agreement with the standard model prediction. The same data sample is used to place constraints on anomalous WWγWW\gamma and WWZWWZ couplings.Comment: submitted to Phys. Rev. Let

    Observation of the Baryonic Flavor-Changing Neutral Current Decay Lambda_b -> Lambda mu+ mu-

    Get PDF
    We report the first observation of the baryonic flavor-changing neutral current decay Lambda_b -> Lambda mu+ mu- with 24 signal events and a statistical significance of 5.8 Gaussian standard deviations. This measurement uses ppbar collisions data sample corresponding to 6.8fb-1 at sqrt{s}=1.96TeV collected by the CDF II detector at the Tevatron collider. The total and differential branching ratios for Lambda_b -> Lambda mu+ mu- are measured. We find B(Lambda_b -> Lambda mu+ mu-) = [1.73+-0.42(stat)+-0.55(syst)] x 10^{-6}. We also report the first measurement of the differential branching ratio of B_s -> phi mu+ mu- using 49 signal events. In addition, we report branching ratios for B+ -> K+ mu+ mu-, B0 -> K0 mu+ mu-, and B -> K*(892) mu+ mu- decays.Comment: 8 pages, 2 figures, 4 tables. Submitted to Phys. Rev. Let
    corecore