26,691 research outputs found
GRACE at ONE-LOOP: Automatic calculation of 1-loop diagrams in the electroweak theory with gauge parameter independence checks
We describe the main building blocks of a generic automated package for the
calculation of Feynman diagrams. These blocks include the generation and
creation of a model file, the graph generation, the symbolic calculation at an
intermediate level of the Dirac and tensor algebra, implementation of the loop
integrals, the generation of the matrix elements or helicity amplitudes,
methods for the phase space integrations and eventually the event generation.
The report focuses on the fully automated systems for the calculation of
physical processes based on the experience in developing GRACE-loop. As such, a
detailed description of the renormalisation procedure in the Standard Model is
given emphasizing the central role played by the non-linear gauge fixing
conditions for the construction of such automated codes. The need for such
gauges is better appreciated when it comes to devising efficient and powerful
algorithms for the reduction of the tensorial structures of the loop integrals.
A new technique for these reduction algorithms is described. Explicit formulae
for all two-point functions in a generalised non-linear gauge are given,
together with the complete set of counterterms. We also show how infrared
divergences are dealt with in the system. We give a comprehensive presentation
of some systematic test-runs which have been performed at the one-loop level
for a wide variety of two-to-two processes to show the validity of the gauge
check. These cover fermion-fermion scattering, gauge boson scattering into
fermions, gauge bosons and Higgs bosons scattering processes. Comparisons with
existing results on some one-loop computation in the Standard Model show
excellent agreement. We also briefly recount some recent development concerning
the calculation of mutli-leg one-loop corrections.Comment: 131 pages. Manuscript expanded quite substantially with the inclusion
of an overview of automatic systems for the calculation of Feynman diagrams
both at tree-level and one-loop. Other additions include issues of
regularisation, width effects and renormalisation with unstable particles and
reduction of 5- and 6-point functions. This is a preprint version, final
version to appear as a Phys. Re
swTVM: Exploring the Automated Compilation for Deep Learning on Sunway Architecture
The flourish of deep learning frameworks and hardware platforms has been
demanding an efficient compiler that can shield the diversity in both software
and hardware in order to provide application portability. Among the exiting
deep learning compilers, TVM is well known for its efficiency in code
generation and optimization across diverse hardware devices. In the meanwhile,
the Sunway many-core processor renders itself as a competitive candidate for
its attractive computational power in both scientific and deep learning
applications. This paper combines the trends in these two directions.
Specifically, we propose swTVM that extends the original TVM to support
ahead-of-time compilation for architecture requiring cross-compilation such as
Sunway. In addition, we leverage the architecture features during the
compilation such as core group for massive parallelism, DMA for high bandwidth
memory transfer and local device memory for data locality, in order to generate
efficient code for deep learning application on Sunway. The experimental
results show the ability of swTVM to automatically generate code for various
deep neural network models on Sunway. The performance of automatically
generated code for AlexNet and VGG-19 by swTVM achieves 6.71x and 2.45x speedup
on average than hand-optimized OpenACC implementations on convolution and fully
connected layers respectively. This work is the first attempt from the compiler
perspective to bridge the gap of deep learning and high performance
architecture particularly with productivity and efficiency in mind. We would
like to open source the implementation so that more people can embrace the
power of deep learning compiler and Sunway many-core processor
From Frequency to Meaning: Vector Space Models of Semantics
Computers understand very little of the meaning of human language. This
profoundly limits our ability to give instructions to computers, the ability of
computers to explain their actions to us, and the ability of computers to
analyse and process text. Vector space models (VSMs) of semantics are beginning
to address these limits. This paper surveys the use of VSMs for semantic
processing of text. We organize the literature on VSMs according to the
structure of the matrix in a VSM. There are currently three broad classes of
VSMs, based on term-document, word-context, and pair-pattern matrices, yielding
three classes of applications. We survey a broad range of applications in these
three categories and we take a detailed look at a specific open source project
in each category. Our goal in this survey is to show the breadth of
applications of VSMs for semantics, to provide a new perspective on VSMs for
those who are already familiar with the area, and to provide pointers into the
literature for those who are less familiar with the field
Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code
This paper introduces Tiramisu, a polyhedral framework designed to generate
high performance code for multiple platforms including multicores, GPUs, and
distributed machines. Tiramisu introduces a scheduling language with novel
extensions to explicitly manage the complexities that arise when targeting
these systems. The framework is designed for the areas of image processing,
stencils, linear algebra and deep learning. Tiramisu has two main features: it
relies on a flexible representation based on the polyhedral model and it has a
rich scheduling language allowing fine-grained control of optimizations.
Tiramisu uses a four-level intermediate representation that allows full
separation between the algorithms, loop transformations, data layouts, and
communication. This separation simplifies targeting multiple hardware
architectures with the same algorithm. We evaluate Tiramisu by writing a set of
image processing, deep learning, and linear algebra benchmarks and compare them
with state-of-the-art compilers and hand-tuned libraries. We show that Tiramisu
matches or outperforms existing compilers and libraries on different hardware
architectures, including multicore CPUs, GPUs, and distributed machines.Comment: arXiv admin note: substantial text overlap with arXiv:1803.0041
Automatic Computation of Feynman Diagrams
Quantum corrections significantly influence the quantities observed in modern
particle physics. The corresponding theoretical computations are usually quite
lengthy which makes their automation mandatory. This review reports on the
current status of automatic calculation of Feynman diagrams in particle
physics. The most important theoretical techniques are introduced and their
usefulness is demonstrated with the help of simple examples. A survey over
frequently used programs and packages is provided, discussing their abilities
and fields of applications. Subsequently, some powerful packages which have
already been applied to important physical problems are described in more
detail. The review closes with the discussion of a few typical applications for
the automated computation of Feynman diagrams, addressing current physical
questions like properties of the and Higgs boson, four-loop corrections to
renormalization group functions and two-loop electroweak corrections.Comment: Latex, 62 pages. Typos corrected, references updated and some
comments added. Vertical offset changed. The complete paper is also available
via anonymous ftp at ftp://ttpux2.physik.uni-karlsruhe.de/ttp98/ttp98-41/ or
via www at http://www-ttp.physik.uni-karlsruhe.de/Preprints
- …