Search CORE

176 research outputs found

NNLO QCD $\times$ EW corrections to Z production in the $q\bar{q}$ channel

Author: Bonciani Roberto
Buccioni Federico
Rana Narayan
Triscari Ilario
Vicini Alessandro
Publication venue: 'American Physical Society (APS)'
Publication date: 14/11/2019
Field of study

We present the first results for the

{\cal O}(\alpha\alpha_s)

corrections to the total partonic cross section of the process

q\bar q\to Z+X

, with the complete set of contributions, that include photonic and massive weak gauge boson effects. The results are relevant for the precise determination of the hadronic

Z

boson production cross section. Virtual and real corrections are calculated analytically using the reduction to the master integrals and their evaluation through differential equations. Real corrections are dealt with using the reverse-unitarity method. They require the evaluation of a new set of two-loop master integrals, with up to three internal massive lines. In particular, three of them are expressed in terms of elliptic integrals. We verify the absence, at this perturbative order, of initial state mass singularities proportional to a weak massive virtual correction to the quark-gluon splitting.Comment: 6 pages, 1 figur

arXiv.org e-Print Archive

Oxford University Research Archive

CERN Document Server

APEnet+: a 3D toroidal network enabling Petaflops scale Lattice QCD simulations on commodity clusters

Author: Ammendola Roberto
Biagioni Andrea
Cicero Francesca Lo
Frezza Ottorino
Lonardo Alessandro
Paolucci Pier
Petronzio Roberto
Rossetti Davide
Salamon Andrea
Salina Gaetano
Simula Francesco
Tantalo Nazario
Tosoratto Laura
Vicini Piero
Publication venue
Publication date: 01/01/2010
Field of study

Many scientific computations need multi-node parallelism for matching up both space (memory) and time (speed) ever-increasing requirements. The use of GPUs as accelerators introduces yet another level of complexity for the programmer and may potentially result in large overheads due to the complex memory hierarchy. Additionally, top-notch problems may easily employ more than a Petaflops of sustained computing power, requiring thousands of GPUs orchestrated with some parallel programming model. Here we describe APEnet+, the new generation of our interconnect, which scales up to tens of thousands of nodes with linear cost, thus improving the price/performance ratio on large clusters. The project target is the development of the Apelink+ host adapter featuring a low latency, high bandwidth direct network, state-of-the-art wire speeds on the links and a PCIe X8 gen2 host interface. It features hardware support for the RDMA programming model and experimental acceleration of GPU networking. A Linux kernel driver, a set of low-level RDMA APIs and an OpenMPI library driver are available, allowing for painless porting of standard applications. Finally, we give an insight of future work and intended developments

arXiv.org e-Print Archive

ART

NaNet:a low-latency NIC enabling GPU-based, real-time low level trigger systems

Author: Ammendola Roberto
Biagioni Andrea
Cicero Francesca Lo
Fantechi Riccardo
Frezza Ottorino
Lamanna Gianluca
Lonardo Alessandro
Pantaleo Felice
Paolucci Pier Stanislao
Piandani Roberto
Pontisso Luca
Rossetti Davide
Simula Francesco
Sozzi Marco
Tosoratto Laura
Vicini Piero
Publication venue: 'IOP Publishing'
Publication date: 05/11/2013
Field of study

We implemented the NaNet FPGA-based PCI2 Gen2 GbE/APElink NIC, featuring GPUDirect RDMA capabilities and UDP protocol management offloading. NaNet is able to receive a UDP input data stream from its GbE interface and redirect it, without any intermediate buffering or CPU intervention, to the memory of a Fermi/Kepler GPU hosted on the same PCIe bus, provided that the two devices share the same upstream root complex. Synthetic benchmarks for latency and bandwidth are presented. We describe how NaNet can be employed in the prototype of the GPU-based RICH low-level trigger processor of the NA62 CERN experiment, to implement the data link between the TEL62 readout boards and the low level trigger processor. Results for the throughput and latency of the integrated system are presented and discussed.Comment: Proceedings for the 20th International Conference on Computing in High Energy and Nuclear Physics (CHEP

arXiv.org e-Print Archive

Crossref

CERN Document Server