26,691 research outputs found

    GRACE at ONE-LOOP: Automatic calculation of 1-loop diagrams in the electroweak theory with gauge parameter independence checks

    Full text link
    We describe the main building blocks of a generic automated package for the calculation of Feynman diagrams. These blocks include the generation and creation of a model file, the graph generation, the symbolic calculation at an intermediate level of the Dirac and tensor algebra, implementation of the loop integrals, the generation of the matrix elements or helicity amplitudes, methods for the phase space integrations and eventually the event generation. The report focuses on the fully automated systems for the calculation of physical processes based on the experience in developing GRACE-loop. As such, a detailed description of the renormalisation procedure in the Standard Model is given emphasizing the central role played by the non-linear gauge fixing conditions for the construction of such automated codes. The need for such gauges is better appreciated when it comes to devising efficient and powerful algorithms for the reduction of the tensorial structures of the loop integrals. A new technique for these reduction algorithms is described. Explicit formulae for all two-point functions in a generalised non-linear gauge are given, together with the complete set of counterterms. We also show how infrared divergences are dealt with in the system. We give a comprehensive presentation of some systematic test-runs which have been performed at the one-loop level for a wide variety of two-to-two processes to show the validity of the gauge check. These cover fermion-fermion scattering, gauge boson scattering into fermions, gauge bosons and Higgs bosons scattering processes. Comparisons with existing results on some one-loop computation in the Standard Model show excellent agreement. We also briefly recount some recent development concerning the calculation of mutli-leg one-loop corrections.Comment: 131 pages. Manuscript expanded quite substantially with the inclusion of an overview of automatic systems for the calculation of Feynman diagrams both at tree-level and one-loop. Other additions include issues of regularisation, width effects and renormalisation with unstable particles and reduction of 5- and 6-point functions. This is a preprint version, final version to appear as a Phys. Re

    swTVM: Exploring the Automated Compilation for Deep Learning on Sunway Architecture

    Full text link
    The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the exiting deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core processor renders itself as a competitive candidate for its attractive computational power in both scientific and deep learning applications. This paper combines the trends in these two directions. Specifically, we propose swTVM that extends the original TVM to support ahead-of-time compilation for architecture requiring cross-compilation such as Sunway. In addition, we leverage the architecture features during the compilation such as core group for massive parallelism, DMA for high bandwidth memory transfer and local device memory for data locality, in order to generate efficient code for deep learning application on Sunway. The experimental results show the ability of swTVM to automatically generate code for various deep neural network models on Sunway. The performance of automatically generated code for AlexNet and VGG-19 by swTVM achieves 6.71x and 2.45x speedup on average than hand-optimized OpenACC implementations on convolution and fully connected layers respectively. This work is the first attempt from the compiler perspective to bridge the gap of deep learning and high performance architecture particularly with productivity and efficiency in mind. We would like to open source the implementation so that more people can embrace the power of deep learning compiler and Sunway many-core processor

    From Frequency to Meaning: Vector Space Models of Semantics

    Full text link
    Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics are beginning to address these limits. This paper surveys the use of VSMs for semantic processing of text. We organize the literature on VSMs according to the structure of the matrix in a VSM. There are currently three broad classes of VSMs, based on term-document, word-context, and pair-pattern matrices, yielding three classes of applications. We survey a broad range of applications in these three categories and we take a detailed look at a specific open source project in each category. Our goal in this survey is to show the breadth of applications of VSMs for semantics, to provide a new perspective on VSMs for those who are already familiar with the area, and to provide pointers into the literature for those who are less familiar with the field

    Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code

    Full text link
    This paper introduces Tiramisu, a polyhedral framework designed to generate high performance code for multiple platforms including multicores, GPUs, and distributed machines. Tiramisu introduces a scheduling language with novel extensions to explicitly manage the complexities that arise when targeting these systems. The framework is designed for the areas of image processing, stencils, linear algebra and deep learning. Tiramisu has two main features: it relies on a flexible representation based on the polyhedral model and it has a rich scheduling language allowing fine-grained control of optimizations. Tiramisu uses a four-level intermediate representation that allows full separation between the algorithms, loop transformations, data layouts, and communication. This separation simplifies targeting multiple hardware architectures with the same algorithm. We evaluate Tiramisu by writing a set of image processing, deep learning, and linear algebra benchmarks and compare them with state-of-the-art compilers and hand-tuned libraries. We show that Tiramisu matches or outperforms existing compilers and libraries on different hardware architectures, including multicore CPUs, GPUs, and distributed machines.Comment: arXiv admin note: substantial text overlap with arXiv:1803.0041

    Automatic Computation of Feynman Diagrams

    Full text link
    Quantum corrections significantly influence the quantities observed in modern particle physics. The corresponding theoretical computations are usually quite lengthy which makes their automation mandatory. This review reports on the current status of automatic calculation of Feynman diagrams in particle physics. The most important theoretical techniques are introduced and their usefulness is demonstrated with the help of simple examples. A survey over frequently used programs and packages is provided, discussing their abilities and fields of applications. Subsequently, some powerful packages which have already been applied to important physical problems are described in more detail. The review closes with the discussion of a few typical applications for the automated computation of Feynman diagrams, addressing current physical questions like properties of the ZZ and Higgs boson, four-loop corrections to renormalization group functions and two-loop electroweak corrections.Comment: Latex, 62 pages. Typos corrected, references updated and some comments added. Vertical offset changed. The complete paper is also available via anonymous ftp at ftp://ttpux2.physik.uni-karlsruhe.de/ttp98/ttp98-41/ or via www at http://www-ttp.physik.uni-karlsruhe.de/Preprints
    corecore