2,487 research outputs found
Optimising Sparse Matrix Vector multiplication for large scale FEM problems on FPGA
Sparse Matrix Vector multiplication (SpMV) is an important kernel in many scientific applications. In this work we propose an architecture and an automated customisation method to detect and optimise the architecture for block diagonal sparse matrices. We evaluate the proposed approach in the context of the spectral/hp Finite Element Method, using the local matrix assembly approach. This problem leads to a large sparse system of linear equations with block diagonal matrix which is typically solved using an iterative method such as the Preconditioned Conjugate Gradient. The efficiency of the proposed architecture combined with the effectiveness of the proposed customisation method reduces BRAM resource utilisation by as much as 10 times, while achieving identical throughput with existing state of the art designs and requiring minimal development effort from the end user. In the context of the Finite Element Method, our approach enables the solution of larger problems than previously possible, enabling the applicability of FPGAs to more interesting HPC problems
An efficient sparse conjugate gradient solver using a Beneš permutation network
© 2014 Technical University of Munich (TUM).The conjugate gradient (CG) is one of the most widely used iterative methods for solving systems of linear equations. However, parallelizing CG for large sparse systems is difficult due to the inherent irregularity in memory access pattern. We propose a novel processor architecture for the sparse conjugate gradient method. The architecture consists of multiple processing elements and memory banks, and is able to compute efficiently both sparse matrix-vector multiplication, and other dense vector operations. A Beneš permutation network with an optimised control scheme is introduced to reduce memory bank conflicts without expensive logic. We describe a heuristics for offline scheduling, the effect of which is captured in a parametric model for estimating the performance of designs generated from our approach
CASK - Open-source custom architectures for sparse kernels
© 2016 ACM.Sparse matrix vector multiplication (SpMV) is an impor- tant kernel in many scientific applications. To improve the performance and applicability of FPGA based SpMV, we propose an approach for exploiting properties of the input matrix to generate optimised custom architectures. The ar- chitectures generated by our approach are between 3.8 to 48 times faster than the worst case architectures for each matrix, showing the benefits of instance specific design for SpMV
Run-time reconfigurable acceleration for genetic programming fitness evaluation in trading strategies
Genetic programming can be used to identify complex patterns in financial markets which may lead to more advanced trading strategies. However, the computationally intensive nature of genetic programming makes it difficult to apply to real world problems, particularly in real-time constrained scenarios. In this work we propose the use of Field Programmable Gate Array technology to accelerate the fitness evaluation step, one of the most computationally demanding operations in genetic programming. We propose to develop a fully-pipelined, mixed precision design using run-time reconfiguration to accelerate fitness evaluation. We show that run-time reconfiguration can reduce resource consumption by a factor of 2 compared to previous solutions on certain configurations. The proposed design is up to 22 times faster than an optimised, multithreaded software implementation while achieving comparable financial returns
Point prevalence of surgical checklist use in Europe: relationship with hospital mortality
Background The prevalence of use of the World Health Organization surgical checklist is unknown. The clinical effectiveness of this intervention in improving postoperative outcomes is debated. Methods We undertook a retrospective analysis of data describing surgical checklist use from a 7 day cohort study of surgical outcomes in 28 European nations (European Surgical Outcomes Study, EuSOS). The analysis included hospitals recruiting >10 patients and excluding outlier hospitals above the 95th centile for mortality. Multivariate logistic regression and three-level hierarchical generalized mixed models were constructed to explore the relationship between surgical checklist use and hospital mortality. Findings are presented as crude and adjusted odds ratios (ORs) with 95% confidence intervals (CIs). Results A total of 45 591 patients from 426 hospitals were included in the analysis. A surgical checklist was used in 67.5% patients, with marked variation across countries (0-99.6% of patients). Surgical checklist exposure was associated with lower crude hospital mortality (OR 0.84, CI 0.75-0.94; P=0.002). This effect remained after adjustment for baseline risk factors in a multivariate model (adjusted OR 0.81, CI 0.70-0.94; P<0.005) and strengthened after adjusting for variations within countries and hospitals in a three-level generalized mixed model (adjusted OR 0.71, CI 0.58-0.85; P<0.001). Conclusions The use of surgical checklists varies across European nations. Reported use of a checklist was associated with lower mortality. This observation may represent a protective effect of the surgical checklist itself, or alternatively, may be an indirect indicator of the quality of perioperative care. Clinical trial registration The European Surgical Outcomes Study is registered with ClinicalTrials.gov, number NCT0120360
Multiplicity dependence of jet-like two-particle correlations in p-Pb collisions at = 5.02 TeV
Two-particle angular correlations between unidentified charged trigger and
associated particles are measured by the ALICE detector in p-Pb collisions at a
nucleon-nucleon centre-of-mass energy of 5.02 TeV. The transverse-momentum
range 0.7 5.0 GeV/ is examined,
to include correlations induced by jets originating from low
momen\-tum-transfer scatterings (minijets). The correlations expressed as
associated yield per trigger particle are obtained in the pseudorapidity range
. The near-side long-range pseudorapidity correlations observed in
high-multiplicity p-Pb collisions are subtracted from both near-side
short-range and away-side correlations in order to remove the non-jet-like
components. The yields in the jet-like peaks are found to be invariant with
event multiplicity with the exception of events with low multiplicity. This
invariance is consistent with the particles being produced via the incoherent
fragmentation of multiple parton--parton scatterings, while the yield related
to the previously observed ridge structures is not jet-related. The number of
uncorrelated sources of particle production is found to increase linearly with
multiplicity, suggesting no saturation of the number of multi-parton
interactions even in the highest multiplicity p-Pb collisions. Further, the
number scales in the intermediate multiplicity region with the number of binary
nucleon-nucleon collisions estimated with a Glauber Monte-Carlo simulation.Comment: 23 pages, 6 captioned figures, 1 table, authors from page 17,
published version, figures at
http://aliceinfo.cern.ch/ArtSubmission/node/161
Charge separation relative to the reaction plane in Pb-Pb collisions at TeV
Measurements of charge dependent azimuthal correlations with the ALICE
detector at the LHC are reported for Pb-Pb collisions at TeV. Two- and three-particle charge-dependent azimuthal correlations in
the pseudo-rapidity range are presented as a function of the
collision centrality, particle separation in pseudo-rapidity, and transverse
momentum. A clear signal compatible with a charge-dependent separation relative
to the reaction plane is observed, which shows little or no collision energy
dependence when compared to measurements at RHIC energies. This provides a new
insight for understanding the nature of the charge dependent azimuthal
correlations observed at RHIC and LHC energies.Comment: 12 pages, 3 captioned figures, authors from page 2 to 6, published
version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/286
Multi-particle azimuthal correlations in p-Pb and Pb-Pb collisions at the CERN Large Hadron Collider
Measurements of multi-particle azimuthal correlations (cumulants) for charged
particles in p-Pb and Pb-Pb collisions are presented. They help address the
question of whether there is evidence for global, flow-like, azimuthal
correlations in the p-Pb system. Comparisons are made to measurements from the
larger Pb-Pb system, where such evidence is established. In particular, the
second harmonic two-particle cumulants are found to decrease with multiplicity,
characteristic of a dominance of few-particle correlations in p-Pb collisions.
However, when a gap is placed to suppress such correlations,
the two-particle cumulants begin to rise at high-multiplicity, indicating the
presence of global azimuthal correlations. The Pb-Pb values are higher than the
p-Pb values at similar multiplicities. In both systems, the second harmonic
four-particle cumulants exhibit a transition from positive to negative values
when the multiplicity increases. The negative values allow for a measurement of
to be made, which is found to be higher in Pb-Pb collisions at
similar multiplicities. The second harmonic six-particle cumulants are also
found to be higher in Pb-Pb collisions. In Pb-Pb collisions, we generally find
which is indicative of a Bessel-Gaussian
function for the distribution. For very high-multiplicity Pb-Pb
collisions, we observe that the four- and six-particle cumulants become
consistent with 0. Finally, third harmonic two-particle cumulants in p-Pb and
Pb-Pb are measured. These are found to be similar for overlapping
multiplicities, when a gap is placed.Comment: 25 pages, 11 captioned figures, 3 tables, authors from page 20,
published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/87
Anisotropic flow of charged hadrons, pions and (anti-)protons measured at high transverse momentum in Pb-Pb collisions at TeV
The elliptic, , triangular, , and quadrangular, , azimuthal
anisotropic flow coefficients are measured for unidentified charged particles,
pions and (anti-)protons in Pb-Pb collisions at TeV
with the ALICE detector at the Large Hadron Collider. Results obtained with the
event plane and four-particle cumulant methods are reported for the
pseudo-rapidity range at different collision centralities and as a
function of transverse momentum, , out to GeV/.
The observed non-zero elliptic and triangular flow depends only weakly on
transverse momentum for GeV/. The small dependence
of the difference between elliptic flow results obtained from the event plane
and four-particle cumulant methods suggests a common origin of flow
fluctuations up to GeV/. The magnitude of the (anti-)proton
elliptic and triangular flow is larger than that of pions out to at least
GeV/ indicating that the particle type dependence persists out
to high .Comment: 16 pages, 5 captioned figures, authors from page 11, published
version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/186
Centrality dependence of charged particle production at large transverse momentum in Pb-Pb collisions at TeV
The inclusive transverse momentum () distributions of primary
charged particles are measured in the pseudo-rapidity range as a
function of event centrality in Pb-Pb collisions at
TeV with ALICE at the LHC. The data are presented in the range
GeV/ for nine centrality intervals from 70-80% to 0-5%.
The Pb-Pb spectra are presented in terms of the nuclear modification factor
using a pp reference spectrum measured at the same collision
energy. We observe that the suppression of high- particles strongly
depends on event centrality. In central collisions (0-5%) the yield is most
suppressed with at -7 GeV/. Above
GeV/, there is a significant rise in the nuclear modification
factor, which reaches for GeV/. In
peripheral collisions (70-80%), the suppression is weaker with almost independently of . The measured nuclear
modification factors are compared to other measurements and model calculations.Comment: 17 pages, 4 captioned figures, 2 tables, authors from page 12,
published version, figures at
http://aliceinfo.cern.ch/ArtSubmission/node/284
- …