2,487 research outputs found

    Optimising Sparse Matrix Vector multiplication for large scale FEM problems on FPGA

    Get PDF
    Sparse Matrix Vector multiplication (SpMV) is an important kernel in many scientific applications. In this work we propose an architecture and an automated customisation method to detect and optimise the architecture for block diagonal sparse matrices. We evaluate the proposed approach in the context of the spectral/hp Finite Element Method, using the local matrix assembly approach. This problem leads to a large sparse system of linear equations with block diagonal matrix which is typically solved using an iterative method such as the Preconditioned Conjugate Gradient. The efficiency of the proposed architecture combined with the effectiveness of the proposed customisation method reduces BRAM resource utilisation by as much as 10 times, while achieving identical throughput with existing state of the art designs and requiring minimal development effort from the end user. In the context of the Finite Element Method, our approach enables the solution of larger problems than previously possible, enabling the applicability of FPGAs to more interesting HPC problems

    An efficient sparse conjugate gradient solver using a Beneš permutation network

    Get PDF
    © 2014 Technical University of Munich (TUM).The conjugate gradient (CG) is one of the most widely used iterative methods for solving systems of linear equations. However, parallelizing CG for large sparse systems is difficult due to the inherent irregularity in memory access pattern. We propose a novel processor architecture for the sparse conjugate gradient method. The architecture consists of multiple processing elements and memory banks, and is able to compute efficiently both sparse matrix-vector multiplication, and other dense vector operations. A Beneš permutation network with an optimised control scheme is introduced to reduce memory bank conflicts without expensive logic. We describe a heuristics for offline scheduling, the effect of which is captured in a parametric model for estimating the performance of designs generated from our approach

    CASK - Open-source custom architectures for sparse kernels

    No full text
    © 2016 ACM.Sparse matrix vector multiplication (SpMV) is an impor- tant kernel in many scientific applications. To improve the performance and applicability of FPGA based SpMV, we propose an approach for exploiting properties of the input matrix to generate optimised custom architectures. The ar- chitectures generated by our approach are between 3.8 to 48 times faster than the worst case architectures for each matrix, showing the benefits of instance specific design for SpMV

    Run-time reconfigurable acceleration for genetic programming fitness evaluation in trading strategies

    Get PDF
    Genetic programming can be used to identify complex patterns in financial markets which may lead to more advanced trading strategies. However, the computationally intensive nature of genetic programming makes it difficult to apply to real world problems, particularly in real-time constrained scenarios. In this work we propose the use of Field Programmable Gate Array technology to accelerate the fitness evaluation step, one of the most computationally demanding operations in genetic programming. We propose to develop a fully-pipelined, mixed precision design using run-time reconfiguration to accelerate fitness evaluation. We show that run-time reconfiguration can reduce resource consumption by a factor of 2 compared to previous solutions on certain configurations. The proposed design is up to 22 times faster than an optimised, multithreaded software implementation while achieving comparable financial returns

    Point prevalence of surgical checklist use in Europe: relationship with hospital mortality

    Get PDF
    Background The prevalence of use of the World Health Organization surgical checklist is unknown. The clinical effectiveness of this intervention in improving postoperative outcomes is debated. Methods We undertook a retrospective analysis of data describing surgical checklist use from a 7 day cohort study of surgical outcomes in 28 European nations (European Surgical Outcomes Study, EuSOS). The analysis included hospitals recruiting >10 patients and excluding outlier hospitals above the 95th centile for mortality. Multivariate logistic regression and three-level hierarchical generalized mixed models were constructed to explore the relationship between surgical checklist use and hospital mortality. Findings are presented as crude and adjusted odds ratios (ORs) with 95% confidence intervals (CIs). Results A total of 45 591 patients from 426 hospitals were included in the analysis. A surgical checklist was used in 67.5% patients, with marked variation across countries (0-99.6% of patients). Surgical checklist exposure was associated with lower crude hospital mortality (OR 0.84, CI 0.75-0.94; P=0.002). This effect remained after adjustment for baseline risk factors in a multivariate model (adjusted OR 0.81, CI 0.70-0.94; P<0.005) and strengthened after adjusting for variations within countries and hospitals in a three-level generalized mixed model (adjusted OR 0.71, CI 0.58-0.85; P<0.001). Conclusions The use of surgical checklists varies across European nations. Reported use of a checklist was associated with lower mortality. This observation may represent a protective effect of the surgical checklist itself, or alternatively, may be an indirect indicator of the quality of perioperative care. Clinical trial registration The European Surgical Outcomes Study is registered with ClinicalTrials.gov, number NCT0120360

    Multiplicity dependence of jet-like two-particle correlations in p-Pb collisions at sNN\sqrt{s_{NN}} = 5.02 TeV

    Full text link
    Two-particle angular correlations between unidentified charged trigger and associated particles are measured by the ALICE detector in p-Pb collisions at a nucleon-nucleon centre-of-mass energy of 5.02 TeV. The transverse-momentum range 0.7 <pT,assoc<pT,trig< < p_{\rm{T}, assoc} < p_{\rm{T}, trig} < 5.0 GeV/cc is examined, to include correlations induced by jets originating from low momen\-tum-transfer scatterings (minijets). The correlations expressed as associated yield per trigger particle are obtained in the pseudorapidity range η<0.9|\eta|<0.9. The near-side long-range pseudorapidity correlations observed in high-multiplicity p-Pb collisions are subtracted from both near-side short-range and away-side correlations in order to remove the non-jet-like components. The yields in the jet-like peaks are found to be invariant with event multiplicity with the exception of events with low multiplicity. This invariance is consistent with the particles being produced via the incoherent fragmentation of multiple parton--parton scatterings, while the yield related to the previously observed ridge structures is not jet-related. The number of uncorrelated sources of particle production is found to increase linearly with multiplicity, suggesting no saturation of the number of multi-parton interactions even in the highest multiplicity p-Pb collisions. Further, the number scales in the intermediate multiplicity region with the number of binary nucleon-nucleon collisions estimated with a Glauber Monte-Carlo simulation.Comment: 23 pages, 6 captioned figures, 1 table, authors from page 17, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/161

    Charge separation relative to the reaction plane in Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}}= 2.76 TeV

    Get PDF
    Measurements of charge dependent azimuthal correlations with the ALICE detector at the LHC are reported for Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}} = 2.76 TeV. Two- and three-particle charge-dependent azimuthal correlations in the pseudo-rapidity range η<0.8|\eta| < 0.8 are presented as a function of the collision centrality, particle separation in pseudo-rapidity, and transverse momentum. A clear signal compatible with a charge-dependent separation relative to the reaction plane is observed, which shows little or no collision energy dependence when compared to measurements at RHIC energies. This provides a new insight for understanding the nature of the charge dependent azimuthal correlations observed at RHIC and LHC energies.Comment: 12 pages, 3 captioned figures, authors from page 2 to 6, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/286

    Multi-particle azimuthal correlations in p-Pb and Pb-Pb collisions at the CERN Large Hadron Collider

    Full text link
    Measurements of multi-particle azimuthal correlations (cumulants) for charged particles in p-Pb and Pb-Pb collisions are presented. They help address the question of whether there is evidence for global, flow-like, azimuthal correlations in the p-Pb system. Comparisons are made to measurements from the larger Pb-Pb system, where such evidence is established. In particular, the second harmonic two-particle cumulants are found to decrease with multiplicity, characteristic of a dominance of few-particle correlations in p-Pb collisions. However, when a Δη|\Delta \eta| gap is placed to suppress such correlations, the two-particle cumulants begin to rise at high-multiplicity, indicating the presence of global azimuthal correlations. The Pb-Pb values are higher than the p-Pb values at similar multiplicities. In both systems, the second harmonic four-particle cumulants exhibit a transition from positive to negative values when the multiplicity increases. The negative values allow for a measurement of v2{4}v_{2}\{4\} to be made, which is found to be higher in Pb-Pb collisions at similar multiplicities. The second harmonic six-particle cumulants are also found to be higher in Pb-Pb collisions. In Pb-Pb collisions, we generally find v2{4}v2{6}0v_{2}\{4\} \simeq v_{2}\{6\}\neq 0 which is indicative of a Bessel-Gaussian function for the v2v_{2} distribution. For very high-multiplicity Pb-Pb collisions, we observe that the four- and six-particle cumulants become consistent with 0. Finally, third harmonic two-particle cumulants in p-Pb and Pb-Pb are measured. These are found to be similar for overlapping multiplicities, when a Δη>1.4|\Delta\eta| > 1.4 gap is placed.Comment: 25 pages, 11 captioned figures, 3 tables, authors from page 20, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/87

    Anisotropic flow of charged hadrons, pions and (anti-)protons measured at high transverse momentum in Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}}=2.76 TeV

    Get PDF
    The elliptic, v2v_2, triangular, v3v_3, and quadrangular, v4v_4, azimuthal anisotropic flow coefficients are measured for unidentified charged particles, pions and (anti-)protons in Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}} = 2.76 TeV with the ALICE detector at the Large Hadron Collider. Results obtained with the event plane and four-particle cumulant methods are reported for the pseudo-rapidity range η<0.8|\eta|<0.8 at different collision centralities and as a function of transverse momentum, pTp_{\rm T}, out to pT=20p_{\rm T}=20 GeV/cc. The observed non-zero elliptic and triangular flow depends only weakly on transverse momentum for pT>8p_{\rm T}>8 GeV/cc. The small pTp_{\rm T} dependence of the difference between elliptic flow results obtained from the event plane and four-particle cumulant methods suggests a common origin of flow fluctuations up to pT=8p_{\rm T}=8 GeV/cc. The magnitude of the (anti-)proton elliptic and triangular flow is larger than that of pions out to at least pT=8p_{\rm T}=8 GeV/cc indicating that the particle type dependence persists out to high pTp_{\rm T}.Comment: 16 pages, 5 captioned figures, authors from page 11, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/186

    Centrality dependence of charged particle production at large transverse momentum in Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm{NN}}} = 2.76 TeV

    Get PDF
    The inclusive transverse momentum (pTp_{\rm T}) distributions of primary charged particles are measured in the pseudo-rapidity range η<0.8|\eta|<0.8 as a function of event centrality in Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm{NN}}}=2.76 TeV with ALICE at the LHC. The data are presented in the pTp_{\rm T} range 0.15<pT<500.15<p_{\rm T}<50 GeV/cc for nine centrality intervals from 70-80% to 0-5%. The Pb-Pb spectra are presented in terms of the nuclear modification factor RAAR_{\rm{AA}} using a pp reference spectrum measured at the same collision energy. We observe that the suppression of high-pTp_{\rm T} particles strongly depends on event centrality. In central collisions (0-5%) the yield is most suppressed with RAA0.13R_{\rm{AA}}\approx0.13 at pT=6p_{\rm T}=6-7 GeV/cc. Above pT=7p_{\rm T}=7 GeV/cc, there is a significant rise in the nuclear modification factor, which reaches RAA0.4R_{\rm{AA}} \approx0.4 for pT>30p_{\rm T}>30 GeV/cc. In peripheral collisions (70-80%), the suppression is weaker with RAA0.7R_{\rm{AA}} \approx 0.7 almost independently of pTp_{\rm T}. The measured nuclear modification factors are compared to other measurements and model calculations.Comment: 17 pages, 4 captioned figures, 2 tables, authors from page 12, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/284
    corecore